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NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
HELICOBACTER PYLORI FOR DIAGNOSTICS AND THERAPEUTICS 

Background _f the Invention 
5 Helicobacter pylori is a gram-negative. S-shaped, microaerophilic bacterium that 

was discovered and cultured from a human gastric biopsy specimen. (Warren, J.R. and B. 
Marshall, (1983) Lancet V. 1273-1275; and Marshall et al., (1984) Microbios Lett. 25: 83- 
88). H pylori has been strongly linked to chronic gastritis and duodenal ulcer disease. 
(Rathbone et al., (1986) Gut 2J: 635-641). Moreover, evidence is accumulating for an 
1 0 etiologic role of K pylori in nonulcer dyspepsia, gastric ulcer disease, and gastric 

adenocarcinoma. (Blaser M. J., (1993) Trends Microbiol. \: 255-260). Transmission of the 
bacteria occurs via the oral route, and the risk of infection increases with age. (Taylor, 
D.N. and M. J. Blaser, (1991) Epidemiol Rev 13: 42-50). H pylori colonizes the human 
gastric mucosa, establishing an infection that usually persists for decades. Infection by H. 
15 pylori is prevalent worldwide. Developed countries have infection rates over 50% of the 
adult population, while developing countries have infection rates reaching 90% of the 
adults over the age of 20. (Hopkins R. J. and J. G. Morris (1994) Am. J. Med 97: 265-277). 

The bacterial factors necessary for colonization of the gastric environment, and for 
virulence of this pathogen, are poorly understood. Examples of the putative virulence 
20 factors include the following: urease, an enzyme that may play a role in neutralizing gastric 
acid pH (Eaton et al., (1991) Infect Immunol 59: 2470-2475; Ferrero, RX. and A. Lee 
(1991) Microb. Ecol Hlth. Dis. 4: 121-134; Labigne et al., (1991) J. Bacteriol 173: 1920- 
1931); the bacterial flagellar proteins responsible for motility across the mucous layer. 
(Hazell et al., (1986) J. Inf. Dis. 153: 658-663; Leying et al., (1992) Mol Microbiol 6: 
25 2863-2874;. and Haas et al., (1993) Mol Microbiol 8: 753-760); Vac A, a bacterial toxin 
that induces the formation of intracellular vacuoles in epithelial ceils (Schmitt, W. and R. 
Haas, (1994) Molecular Microbiol. 12(2): 307-319); and several gastric tissue-specific 
adhesins. (Borenetal., (1993) Science 262: 1892-1895; Evans etal., (1993)7. Bacteriol. 
175 : 674-683; and Falk et al., (1993) Proc. Natl Acad Set USA 2Q: 2035-203). 
30 Numerous therapeutic agents are currently available that eradicate H. pylori 

infections in vitro. (Huesca et al., (1993) Zbl Bakt. 280: 244-252; Hopkins, R. J. and J. G. 
Morris, supra). However, many of these treatments are suboptimally effective in vivo 
because of bacterial resistance, altered drug distribution, patient non-compliance or poor 
drug availabilty. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics 
35 combined with bismuth are part of the standard regime used to treat H. pylori infection. 
(Malfertheiner, P. and J. E. Dominguez-Munoz (1993) Clinical Therapeutics 15 Supp. B: 
37-48). Recently, combinations of a proton pump inhibitors and a single antibiotic have 
been shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E. Dominguez- 
Munoz supra). However, methods employing antibiotic agents can have the problem of the 
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emergence of bacterial strains which are resistant to these agents. (Hopkins, R. J. and J. G. 
Morris, supra). These limitations demonstrate that new more effective methods are needed 
to combat H. pylori infections in vivo. In particular, the design of new vaccines that may 
prevent infection by this bacterium is highly desirable. 

5 

Summary of the Invention 

This invention relates to novel genes, e.g., genes encoding polypeptides such as 
bacterial surface proteins, from the organism Helicobacter pylori (K pylori), and other 
related genes, their products, and uses thereof. The nucleic acids and peptides of the 

1 0 present invention have utility for diagnostic and therapeutics for H. pylori and other 

Helicobacter species. They can also be used to detect the presence of H pylori and other 
Helicobacter species in a sample; and for use in screening compounds for the ability to 
interfere with the H. pylori life cycle or to inhibit H pylori infection. More specifically, 
this invention features compositions of nucleic acids corresponding to entire coding 

1 5 sequences of H. pylori proteins, including surface or secreted proteins or parts thereof, 
nucleic acids capable of binding mRNA from H pylori proteins to block protein 
translation, and methods for producing H pylori proteins or parts thereof using peptide 
synthesis and recombinant DNA techniques. This invention also features antibodies and 
nucleic acids useful as probes to detect H pylori infection. In addition, vaccine 

20 compositions and methods for the protection or treatment of infection by K pylori are 
within the scope of this invention. 

Detailed Description of the Drawings 

Figure 1 is a bar graph that depicts the antibody titer in serum of mice following 
25 immunization with specific H. pylori antigens. 

Figure 2 is a bar graph that depicts the antibody titer in mucous of mice following 
immunization with specific H pylori antigens. 

30 Figure 3 is a bar graph that depicts therapeutic immunization of H. pylori infected 

mice with specific antigens dissolved in HEPES buffer. 

Figure 4 is a bar graph that depicts therapeutic immunization of H. pylori infected 
mice with specific antigens dissolved in buffer containing DOC. 

35 

Detailed Description of the Invention 

In one aspect, the invention features a recombinant or substantially pure preparation 
ofH. pylori polypeptide of SEQ ID NO: 384. The invention also includes substantially 
pure nucleic acid encoding an K pylori polypeptide of SEQ ID NO: 384, such nucleic acid 
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is contained in SEQ ID NO: 1. The H. pylori polypeptide sequences of the invention • 
described herein are contained in the Sequence Listing, and the nucleic acids encoding H. 
pylori polypeptides of the invention are contained in the Sequence Listing. 

In another aspect, the invention features a recombinant or substantially pure 
5 preparation of an H. pylori polypeptide selected from the group consisting of /f. pry/on" 
polypeptides of SEQ ID NO: 384 through SEQ ID NO: 389, SEQ ID NO: 391 through SEQ 
ID NO: 400, SEQ ID NO: 402 through SEQ ID NO: 406, SEQ ID NO: 408, SEQ ID NO: 
41 1 through SEQ ID NO: 412, SEQ ID NO: 414 through SEQ ID NO: 430, SEQ ID NO: 
432 through SEQ ID NO: 434, SEQ ID NO: 436 through SEQ ID NO: 441, and SEQ ID 

10 NO: 443. The invention also includes substantially pure nucleic acid encoding an H. pylori 
polypeptide selected from the group consisting of H. pylori polypeptides SEQ ID NO: 384 
through SEQ ID NO: 389, SEQ ID NO: 391 through SEQ ID NO: 400, SEQ ID NO: 402 
through SEQ ID NO: 406, SEQ ID NO: 408, SEQ ID NO: 41 1 through SEQ ID NO: 412, 
SEQ ID NO: 414 through SEQ ID NO: 430, SEQ ID NO: 432 through SEQ ID NO: 434, 

1 5 SEQ ID NO: 436 through SEQ ID NO: 441, and SEQ ID NO: 443, such nucleic acids are 
contained in SEQ ID NO: 1 through SEQ ID NO: 50. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 444, SEQ ID NO: 446 through SEQ ID NO: 448, SEQ ID 

20 NO: 450 through SEQ ID NO: 462, SEQ ID NO: 465 through SEQ ID NO: 466, SEQ ID 
NO: 468 through SEQ ID NO: 469, SEQ ID NO: 471 through SEQ ED NO: 473, SEQ ID 
NO: 475, SEQ ID NO: 478 through SEQ ID NO: 479, SEQ ID NO: 481 through SEQ ID 
NO: 484, SEQ ID NO: 486, SEQ ID NO: 488 through SEQ ID NO: 501, and SEQ ID NO: 
503 through SEQ ID NO: 506. The invention also includes substantially pure nucleic acid 

25 encoding an H. pylori polypeptide selected from the group consisting of H. pylori 

polypeptides of SEQ ID NO: 444, SEQ ID NO: 446 through SEQ ID NO: 448, SEQ ID 
NO: 450 through SEQ ID NO: 462, SEQ ED NO: 465 through SEQ ED NO: 466, SEQ ID 
NO: 468 through SEQ ED NO: 469, SEQ ED NO: 471 through SEQ ED NO: 473, SEQ ID 
NO: 475, SEQ ID NO: 478 through SEQ ED NO: 479, SEQ ED NO: 481 through SEQ ED 

30 NO: 484, SEQ ID NO: 486, SEQ ED NO: 488 through SEQ ID NO: 501, and SEQ ID NO: 
503 through SEQ ED NO: 506, such nucleic acids are contained in SEQ ID NO: 51 through 
SEQ ID NO: 100. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
35 polypeptides of SEQ ID NO: 509 through SEQ ID NO: 510, SEQ ID NO: 512 through SEQ 
ID NO: 514, SEQ ID NO: 516, SEQ ID NO: 518 through SEQ ID NO: 520, SEQ ID NO: 
522 through SEQ ID NO: 525, SEQ ID NO: 527 through SEQ ED NO: 533, SEQ ED NO: 
535 through SEQ ID NO: 537, SEQ ID NO: 539 through SEQ ED NO: 540, SEQ ID NO: 
542 through SEQ ID NO: 544, SEQ ID NO: 546 through SEQ ID NO: 548, SEQ ED NO: 
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550, SEQ ID NO: 553 through SEQ IDNO: 556, SEQ ID NO: 558, SEQ ID NO: 560, SEQ 
ID NO: 562 through SEQ ID NO: 568, SEQ ID NO: 570, and SEQ ID NO: 572 through 
SEQ ID NO: 575. The invention also includes substantially pure nucleic acid encoding an 
H. pylori polypeptide selected from the group consisting of K pylori polypeptides of SEQ 
5 ID NO: 509 through SEQ ID NO: 510, SEQ ID NO: 512 through SEQ ID NO: 5i4, SEQ 
ID NO: 5 16, SEQ ID NO: 5 1 8 through SEQ ID NO: 520, SEQ ID NO: 522 through SEQ 
ID NO: 525, SEQ ID NO: 527 through SEQ ID NO: 533, SEQ ID NO: 535 through SEQ 
ID NO: 537, SEQ ID NO: 539 through SEQ ID NO: 540, SEQ ID NO: 542 through SEQ 
ID NO: 544, SEQ ID NO: 546 through SEQ ID NO: 548, SEQ ID NO: 550, SEQ ID NO: 

10 553 through SEQ ID NO: 556, SEQ ID NO: 558, SEQ ID NO: 560, SEQ ID NO: 562 
through SEQ ID NO: 568, SEQ ID NO: 570, and SEQ ID NO: 572 through SEQ ID NO: 
575, such nucleic acids are contained in SEQ ID NO: 101 through SEQ ID NO: 150. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 

15 polypeptides of SEQ ID NO: 576 through SEQ ID NO: 579, SEQ ID NO: 581 through SEQ 
ID NO: 583, SEQ ED NO: 585 through SEQ ID NO: 593, SEQ ID NO: 596 through SEQ 
ID NO: 614, SEQ ID NO: 617 through SEQ ID NO: 623, SEQ ID NO: 625, SEQ ID NO: 
627, SEQ ID NO: 629 through SEQ ID NO: 631, SEQ ID NO: 633, and SEQ ID NO: 635 
through SEQ ID NO: 636. The invention also includes substantially pure nucleic acid 

20 encoding an H. pylori polypeptide selected from the group consisting of H. pylori 

polypeptides of SEQ ID NO: 576 through SEQ ID NO: 579, SEQ ID NO: 581 through SEQ 
ID NO: 583, SEQ ID NO: 585 through SEQ ID NO: 593, SEQ ID NO: 596 through SEQ 
ID NO: 614, SEQ ID NO: 617 through SEQ ID NO: 623, SEQ ID NO: 625, SEQ ID NO: 
627, SEQ ID NO: 629 through SEQ ID NO: 631, SEQ ID NO: 633, and SEQ ID NO: 635 

25 through SEQ ID NO: 636, such nucleic acids are contained in SEQ ED NO: 151 through 
SEQ IDNO: 200. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 638 through SEQ ID NO: 640, SEQ ID NO: 642 through SEQ 

30 ID NO: 643, SEQ ID NO: 647, SEQ ID NO: 649 through SEQ ID NO: 651, SEQ ID NO: 
653 through SEQ ID NO: 661, SEQ ID NO: 663 through SEQ ID NO: 670, SEQ ID NO: 
673 through SEQ ED NO: 674, SEQ ED NO: 676, SEQ ID NO: 678 through SEQ ID NO: 
683, SEQ ID NO: 687 through SEQ ID NO: 692, and SEQ ID NO: 694 through SEQ ID 
NO: 702. The invention also includes substantially pure nucleic acid encoding an H. pylori 

35 polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 
638 through SEQ ID NO: 640, SEQ ID NO: 642 through SEQ ID NO: 643, SEQ ID NO: 
647, SEQ ID NO: 649 through SEQ ID NO: 651, SEQ ID NO: 653 through SEQ ID NO: 
661, SEQ ID NO: 663 through SEQ ID NO: 670, SEQ ID NO: 673 through SEQ ID NO: 
674, SEQ ID NO: 676, SEQ ID NO: 678 through SEQ ID NO: 683, SEQ ID NO: 687 
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through SEQ ID NO: 692, and SEQ ID NO: 694 through SEQ ID NO: 702, such nucleic 
acids are contained in SEQ ID NO: 201 through SEQ ID NO: 250. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 

5 polypeptides of SEQ ID NO: 705 through SEQ ID NO: 708, SEQ ID NO: 712 through SEQ 
ID NO: 714, SEQ ID NO: 716 through SEQ ID NO: 722, SEQ ID NO: 725 through SEQ 
ID NO: 730, SEQ ID NO: 732 through SEQ ID NO: 733, SEQ ID NO: 735 through SEQ 
ID NO: 744, SEQ ID NO: 746 through SEQ ID NO: 752, SEQ ID NO: 755 through SEQ 
ID NO: 757, SEQ ID NO: 759, SEQ ID NO: 761 through SEQ ID NO: 763, and SEQ ID 

10 NO: 767 through SEQ ID NO: 770. The invention also includes substantially pure nucleic 
acid encoding an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 705 through SEQ ID NO: 708, SEQ ID NO: 712 through SEQ 
ID NO: 714, SEQ ID NO: 716 through SEQ ID NO: 722, SEQ ID NO: 725 through SEQ 
ID NO: 730, SEQ ID NO: 732 through SEQ ID NO: 733, SEQ ID NO: 735 through SEQ 

15 ID NO: 744, SEQ ID NO: 746 through SEQ ID NO: 752, SEQ ID NO: 755 through SEQ 
ID NO: 757, SEQ ID NO: 759, SEQ ID NO: 761 through SEQ ID NO: 763, and SEQ ID 
NO: 767 through SEQ ID NO: 770, such nucleic acids are contained in SEQ ID NO: 251 
through SEQ ID NO: 300. 

In another aspect, the invention features a recombinant or substantially pure 

20 preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 771 through SEQ ID NO: 773, SEQ ID NO: 775, SEQ ID 
NO: 777, SEQ ID NO: 779 through SEQ ID NO: 784, SEQ ID NO: 786 through SEQ ID 
NO: 787, SEQ ID NO: 789 through SEQ ID NO: 792, SEQ ID NO: 794, SEQ ID NO: 796, 
SEQ ID NO: 798 through SEQ ID NO: 805, SEQ ID NO: 807 through SEQ ID NO: 81 1, 

25 SEQ ID NO: 813 through SEQ ID NO: 819, SEQ ID NO: 821 through SEQ ID NO: 822, 
SEQ ID NO: 824 through SEQ ID NO: 826, SEQ ID NO: 828 through SEQ ID NO: 832, 
and SEQ ID NO: 835. The invention also includes substantially pure nucleic acid encoding 
an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides of 
SEQ ID NO: 771 through SEQ ID NO: 773, SEQ ID NO: 775, SEQ ID NO: 777, SEQ ID 

30 NO: 779 through SEQ ID NO: 784, SEQ ID NO: 786 through SEQ ID NO: 787, SEQ ID 
NO: 789 through SEQ ID NO: 792, SEQ ID NO: 794, SEQ ID NO: 796, SEQ ID NO: 798 
through SEQ ID NO: 805, SEQ ID NO: 807 through SEQ ID NO: 811, SEQ ID NO: 813 
through SEQ ID NO: 819, SEQ ID NO: 821 through SEQ ID NO: 822, SEQ ID NO: 824 
through SEQ ID NO: 826, SEQ ID NO: 828 through SEQ ID NO: 832, and SEQ ID NO: 

35 835, such nucleic acids are contained in SEQ ID NO: 301 through SEQ ID NO: 350. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting ofH. pylori 
polypeptides of SEQ ID NO: 836 through SEQ ID NO: 841, SEQ ID NO: 843 through SEQ 
ID NO: 851, SEQ ID NO: 853, SEQ ID NO: 855 through SEQ ID NO: 857, SEQ ID NO: 
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859 through SEQ ID NO: 862, SEQ ID NO: 866, SEQ ID NO: 868 through SEQ ID NO: 
871, SEQ ID NO: 873 through SEQ ID NO: 876, and SEQ ID NO: 879. The invention also 
includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the 
group consisting ofH. pylori polypeptides of SEQ ID NO: 836 through SEQ ID NO: 841, 
5 SEQ ID NO: 843 through SEQ ID NO: 85 1, SEQ ID NO: 853, SEQ ID NO: 855 through 
SEQ ID NO: 857, SEQ ID NO: 859 through SEQ ID NO: 862, SEQ ID NO: 866, SEQ ID 
NO: 868 through SEQ ID NO: 871, SEQ ID NO: 873 through SEQ ID NO: 876, and SEQ 
ID NO: 879, such nucleic acids are contained in SEQ ID NO: 351 through SEQ ID NO: 
383. 

10 In another aspect, the invention features a recombinant or substantially pure 

preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 385, SEQ ID NO: 390, SEQ ID NO: 401, SEQ ID NO: 407, 
SEQ ID NO: 409 through SEQ ID NO: 410, SEQ ID NO: 413, SEQ ID NO: 431, SEQ ID 
NO: 435, SEQ ID NO: 442, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 463 through 

15 SEQ ID NO: 464, SEQ ID NO: 467, SEQ ID NO: 470, SEQ ID NO: 474, SEQ ID NO: 476 
through SEQ ID NO: 477, SEQ ID NO: 480, SEQ ID NO: 485, SEQ ID NO: 487, SEQ ID 
NO: 502, SEQ ID NO: 507 through SEQ ID NO: 508, SEQ ID NO: 51 1, SEQ ID NO: 515, 
SEQ ID NO: 517, SEQ ID NO: 521, SEQ ID NO: 526, SEQ ID NO: 534, SEQ ID NO: 
538, SEQ ID NO: 541, SEQ ID NO: 545, SEQ ID NO: 549, SEQ ID NO: 551 through SEQ 

20 ID NO: 552, SEQ ID NO: 557, SEQ ID NO: 559, SEQ ID NO: 561, SEQ ID NO: 569, 
SEQ ID NO: 571, SEQ ID NO: 580, SEQ ID NO: 584, SEQ ID NO: 594 through SEQ ID 
NO: 595, SEQ ID NO: 615 through SEQ ID NO: 616, SEQ ID NO: 624, and SEQ ID NO: 
626. The invention also includes substantially pure nucleic acid encoding an H. pylori 
polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 

25 385, SEQ ID NO: 390, SEQ ID NO: 401 , SEQ ID NO: 407, SEQ ID NO: 409 through SEQ 
ID NO: 410, SEQ ID NO: 413, SEQ ID NO: 43 1 , SEQ ID NO: 435, SEQ ID NO: 442, 
SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 463 through SEQ ID NO: 464, SEQ ID 
NO: 467, SEQ ID NO: 470, SEQ ID NO: 474, SEQ ID NO: 476 through SEQ ID NO: 477, 
SEQ ID NO: 480, SEQ ID NO: 485, SEQ ID NO: 487, SEQ ID NO: 502, SEQ ID NO: 507 

30 through SEQ ID NO: 508, SEQ ID NO: 511, SEQ ID NO: 515, SEQ ID NO: 517, SEQ ID 
NO: 521, SEQ ID NO: 526, SEQ ID NO: 534, SEQ ID NO: 538, SEQ ID NO: 541, SEQ 
ID NO: 545, SEQ ID NO: 549, SEQ ID NO: 551 through SEQ ID NO: 552, SEQ ID NO: 
557, SEQ ID NO: 559, SEQ ID NO: 561, SEQ ID NO: 569, SEQ ID NO: 571, SEQ ID 
NO: 580, SEQ ID NO: 584, SEQ ID NO: 594 through SEQ ID NO: 595, SEQ ID NO: 615 

35 through SEQ ID NO: 616, SEQ ID NO: 624, and SEQ ID NO: 626, such nucleic acids are 
contained in SEQ ID NO: 881 through SEQ ID NO: 930. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of K pylori 
polypeptides of SEQ ID NO: 628, SEQ ID NO: 632, SEQ ID NO: 634, SEQ ID NO: 637, 
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SEQ ID NO: 641, SEQ ID NO: 644 through SEQ ID NO: 646, SEQ ID NO: 648, SEQ ID 
NO: 652, SEQ ID NO: 662, SEQ ID NO: 671 through SEQ ID NO: 672, SEQ ID NO: 675, 
SEQ ID NO: 677, SEQ ID NO: 684 through SEQ ID NO: 686, SEQ ID NO: 693, SEQ ID 
NO: 703 through SEQ ID NO: 704, SEQ ID NO: 709 through SEQ ID NO: 71 1 , SEQ ID 
5 NO: 715, SEQ ID NO: 723 through SEQ ID NO: 724, SEQ ID NO: 731, SEQ ID NO: 734, 
SEQ ID NO: 745, SEQ ID NO: 753 through SEQ ID NO: 754, SEQ ID NO: 758, SEQ ID 
NO: 760, SEQ ID NO: 764 through SEQ ID NO: 766, SEQ ID NO: 774, SEQ ID NO: 776, 
SEQ ID NO: 778, SEQ ID NO: 785, SEQ ID NO: 788, SEQ ID NO: 793, SEQ ID NO: 
795, SEQ ID NO: 797, SEQ ID NO: 806, SEQ ID NO: 812, SEQ ID NO: 820, SEQ ID 
10 NO: 823, and SEQ ID NO: 827. The invention also includes substantially pure nucleic acid 
encoding an H. pylori polypeptide selected from the group consisting ofH. pylori 
polypeptides of SEQ ID NO: 628, SEQ ID NO: 632, SEQ ID NO: 634, SEQ ID NO: 637, 
SEQ ID NO: 641, SEQ ID NO: 644 through SEQ ID NO: 646, SEQ ID NO: 648, SEQ ID 
NO: 652, SEQ ID NO: 662, SEQ ID NO: 671 through SEQ ID NO: 672, SEQ ID NO: 675, 
15 SEQ ID NO: 677, SEQ ID NO: 684 through SEQ ID NO: 686, SEQ ID NO: 693, SEQ tD 
NO: 703 through SEQ ID NO: 704, SEQ ID NO: 709 through SEQ ID NO: 71 1, SEQ ID 
NO: 715, SEQ ED NO: 723 through SEQ ID NO: 724, SEQ ID NO: 731, SEQ ID NO: 734, 
SEQ ID NO: 745, SEQ ID NO: 753 through SEQ ID NO: 754, SEQ ID NO: 758, SEQ ID 
NO: 760, SEQ ID NO: 764 through SEQ ID NO: 766, SEQ ID NO: 774, SEQ ID NO: 776, 
20 SEQ ID NO: 778, SEQ ID NO: 785, SEQ ID NO: 788, SEQ ID NO: 793, SEQ ID NO: 
795, SEQ ID NO: 797, SEQ ID NO: 806, SEQ ID NO: 812, SEQ ID NO: 820, SEQ ID 
NO: 823, and SEQ ID NO: 827, such nucleic acids are contained in SEQ ID NO: 93 1 
through SEQ ID NO: 980. 

In another aspect, the invention features a recombinant or substantially pure 
25 preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 833 through SEQ ID NO: 834, SEQ ID NO: 842, SEQ ID 
NO: 852, SEQ ID NO: 854, SEQ ID NO: 858, SEQ ID NO: 863, SEQ ID NO: 864 through 
SEQ ID NO: 865, SEQ ID NO: 867, SEQ ID NO: 872, SEQ ID NO: 877 through SEQ ID 
NO: 878, and SEQ ID NO: 880. The invention also includes substantially pure nucleic acid 
30 encoding an H. pylori polypeptide selected from the group consisting of H. pylori 

polypeptides of SEQ ID NO: 833 through SEQ ID NO: 834, SEQ ID NO: 842, SEQ ID 
NO: 852, SEQ ID NO: 854, SEQ ID NO: 858, SEQ ID NO: 863, SEQ ID NO: 864 through 
SEQ ID NO: 865, SEQ ID NO: 867, SEQ ID NO: 872, SEQ ID NO: 877 through SEQ ID 
NO: 878, and SEQ ID NO: 880, such nucleic acids are contained in SEQ ID NO: 981 
35 through SEQ ID NO: 994. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of if. pylori 
polypeptides of SEQ ID NO: 1446 through SEQ ID NO: 1461, SEQ ID NO: 1463, and 
SEQ ID NO: 1 465 through SEQ ID NO: 1495. The invention also includes substantially 
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pure nucleic acid encoding an K pylori polypeptide selected from the group consisting f 
H. pylori polypeptides SEQ ID NO: 1446 through SEQ ID NO: .1461, SEQIDNO: 1463, 
and SEQ ID NO: 1465 through SEQ ID NO: 1495, such nucleic acids are contained in SEQ 
ID NO: 995 through SEQ ID NO: SEQ ID NO: 1010, SEQ ID NO: 1012, and SEQ ID NO: 

5 1014 through SEQ ID NO: 1044. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQIDNO: 1497 through SEQ ED NO: 1515, and SEQ ID NO: 1517 
through SEQ ID NO: 1545. The invention also includes substantially pure nucleic acid 

1 0 encoding an H. pylori polypeptide selected from the group consisting of H. pylori 

polypeptides SEQ ID NO: 1497 through SEQ ID NO: 1515, and SEQ ID NO: 15 17 through - 
SEQ ID NO: 1 545, such nucleic acids are contained in SEQ ID NO: 1 046 through SEQ ID 
NO: 1064, and SEQ ID NO: 1066 through SEQ ID NO: 1094. 

In another aspect, the invention features a recombinant or substantially pure 

15 preparation of mH. pylori polypeptide selected from the group consisting of H. pylori 

polypeptides of SEQ ID NO: 1546 through SEQ ID NO: 1595. The invention also includes 
substantially pure nucleic acid encoding anH. pylori polypeptide selected from the group 
consisting of if. pylori polypeptides SEQ ID NO: 1546 through SEQ ID NO: 1595, such 
nucleic acids are contained in SEQ ID NO: 1095 through SEQ ID NO: 1 144. 

20 In another aspect, the invention features a recombinant or substantially pure 

preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 1596 through SEQ ID NO: 1617, SEQ ID NO: 1620 through 
SEQ ID NO: 1645. The invention also includes substantially pure nucleic acid encoding an 
H. pylori polypeptide selected from the group consisting of H. pylori polypeptides SEQ ID 

25 NO: 1596 through SEQ ID NO: 1617, SEQ ID NO: 1620 through SEQ ID NO: 1645, such 
nucleic acids are contained in SEQ ID NO: 1 145 through SEQ ID NO: 1 166, and SEQ ID 
NO: 1 169 through SEQ ID NO: 1 194. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 

30 polypeptidesofSEQIDNO:1646throughSEQIDNO:1681,andSEQIDNO:1683 
through SEQ ID NO: 1 695. The invention also includes substantially pure nucleic acid 
encoding an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides SEQ ID NO: 1646 through SEQ ID NO: 1681, and SEQ ID NO: 1683 through 
SEQ ID NO: 1695, such nucleic acids are contained in SEQ ID NO: 1 195 through SEQ ID 

35 NO: 1230, and SEQ ID NO: 1232 through SEQ ID NO: 1244. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 1696 through SEQ ID NO: 1745. The invention also includes 
substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group 
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consisting of K pylori polypeptides SEQ ID NO: 1696 through SEQ ID NO: 1745, such 
nucleic acids are contained in SEQ ID NO: 1245 through SEQ ID NO: 1294. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of H. pylori 
5 polypeptides of SEQ ID NO: 1746 through SEQ ID NO: 1783, and SEQ ID NO: 1786 
through SEQ ID NO: 1795. The invention also includes substantially pure nucleic acid 
encoding an H. pylori polypeptide selected from the group consisting of H. pylori 
polypeptides SEQ ID NO: 1746 through SEQ ID NO: 1783, and SEQ ID NO: 1786 through 
SEQ ED NO: 1795, such nucleic acids are contained in SEQ ID NO: 1295 through SEQ ID 
10 NO: 1332, and SEQ ID NO: 1335 through SEQ ID NO: 1344. 

In another aspect, the invention features a recombinant or substantially pure _ 
• preparation of an K pylori polypeptide selected from the group consisting of H. pylori 
polypeptides of SEQ ID NO: 1796 through SEQ ID NO: 1817, SEQ ID NO: 1819, SEQ ID 
NO: 1821, SEQ ID NO: 1823 through SEQ ID NO: 1836, and SEQ ID NO: 1838 through 
15 SEQ ID NO: 1845. The invention also includes substantially pure nucleic acid encoding an 
K pylori polypeptide selected from the group consisting of H. pylori polypeptides SEQ ID 
NO: 1796 through SEQ ID NO: 1817, SEQ ID NO: 1819, SEQ ID NO: 1821, SEQ ID NO: 
1823 through SEQ ID NO: 1836, and SEQ ID NO: 1838 through SEQ ID NO: 1845, such 
nucleic acids are contained in SEQ ID NO: 1345 through SEQ ID NO: 1366, SEQ ID NO: 
20 1368, SEQ ID NO: 1370, SEQ ID NO: 1372 through SEQ ID NO: 1385, and SEQ ID NO: 
1387 through SEQ ID NO: 1394. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an H. pylori polypeptide selected from the group consisting of//, pylori 
polypeptides of SEQ ID NO: 1846 through SEQ ID NO: 1896. The invention also includes 
25 substantially pure nucleic acid encoding an K pylori polypeptide selected from the group 
consisting of H. pylori polypeptides SEQ ID NO: 1846 through SEQ ID NO: 1896, such 
nucleic acids are contained in SEQ ID NO: 1395 through SEQ ID NO: 1445. 

In another aspect, the invention features a recombinant or substantially pure 
preparation of an K pylori polypeptide selected from the group consisting of H. pylori 
30 polypeptides of the invention as set forth in the Sequence Listing. The invention also 

includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the 
group consisting ofH. pylori polypeptides of the invention as set forth in the Sequence 
Listing. It should be understood that this invention encompasses each of the K pylori 
polypeptides and nucleic acids encoding such polypeptides as identified in the Sequence 
35 Listing by a given sequence identification number. For example, a representative H. pylori 
polypeptide is contained in SEQ ID NO: 1450. Therefore, this invention encompasses a 
recombinant or substantially pure preparation of an H. pylori polypeptide of SEQ ID NO: 
1450. The invention also includes substantially pure nucleic acid encoding an H. pylori 
polypeptide of SEQ ID NO: 1450. 
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In another aspect, the invention pertains to any individual H. pylori polypeptide 
member or nucleic acid encoding such member from the above-identified groups of H. 
pylori polypeptides (e.g., SEQ ID NO: 1 546 through SEQ ID NO: 1 595) or nucleic acids 
(e.g.,SEQIDNO: 1095-SEQIDNO: 1 144), as weU as any subgroups from wthin the 
5 above-identified groups. Furthermore, the subgroups can preferably consists of 1, 3, 5, 10, 
15, 20, 30 or 40 members of any of the groups identified above, as well as any 
combinations thereof: For example, the group consisting ofH. pylori polypeptides SEQ ID 
NO: 1 846 through SEQ ID NO: 1 896 can be divided into one or more subgroups as 
follows: SEQIDNO: 1846-SEQ IDNO: 1860; SEQ ID NO: 1861-SEQIDNO: 1875; 

10 SEQ ID NO: 1876-SEQ ID NO: 1885; SEQ ID NO: 1886-SEQ ID NO: 1896; or any 
combinations thereof. 

Particularly perferred is an isolated nucleic acid comprising a nucleotide sequence 
encoding an H. pylori cell envelope polypeptide or a fragment thereof. Such nucleic acid is 
selected from the group consisting of SEQ ID NO: 1020, SEQ ID NO: 1 021 , SEQ ID NO: 

15 1036, SEQ ID NO: 1050, SEQ ID NO: 1071, SEQ ID NO: 1 101, SEQ ID NO: 1 135, SEQ 
ID NO: 1276, SEQ ID NO: 1 150, SEQ ID NO: 1 187, SEQ ID NO: 1 192, SEQ ID NO: 
1361, SEQ ID NO: 1379, SEQ ID NO: 1399, SEQ ID NO: 1403, SEQ ID NO: 1400, SEQ 
ID NO: 1 1 89, SEQ ID NO: 1 002, SEQ ID NO: 1213, SEQ ID NO: 121 4, SEQ ID NO: 
1215, SEQ ID NO: 1234, SEQ ID NO: 1236, SEQ ID NO: 1237, SEQ ID NO: 1224, SEQ 

20 ID NO: 1251, SEQ ID NO: 1262, SEQ ID NO: 1 149, SEQ ID NO: 1220, SEQ ID NO: 
1240, SEQ ED NO: 1 164, SEQ ID NO: 1 165, SEQ ID NO: 1404, SEQ ID NO: 1 144, SEQ 
ID NO: 1 182, SEQ ID NO: 1 157, SEQ ID NO: 1 160, SEQ ID NO: 1300, SEQ ID NO: 
1321, SEQ ID NO: 1323, SEQ ID NO: 1329, SEQ ID NO: 1332, SEQ ID NO: 1345, SEQ 
ID NO: 1358, SEQ ID NO: 1 375, SEQ ID NO: 1417, SEQ ID NO: 1283, SEQ ID NO: 

25 1335, SEQ ED NO: 1368, SEQ ID NO: 1 179, SEQ ID NO: 1255, SEQ ID NO: 1258, SEQ 
ID NO: 1044, SEQ ID NO: 1 273, SEQ ID NO: 1219, SEQ ID NO: 1274, SEQ ID NO: 
1210, SEQ ID NO: 1422, SEQ ID NO: 1302, SEQ ID NO: 1308, SEQ ID NO: 1310, SEQ 
ID NO: 1331, SEQ ID NO: 1432. SEQ ID NO: 1052, SEQ ID NO: 1091, SEQ ID NO: 
1421, SEQ ID NO: 1069. SEQIDNO: 1005, SEQIDNO: 1007,SEQIDNO: 1166, SEQ 

30 ID NO: 1 1 77, SEQ ID NO: 11 93. SEQ ID NO: 1206, SEQ ID NO: 1207, SEQ ID NO: 
1304, SEQ ID NO: 1305. SEQ ID NO: 1346, SEQ ID NO: 1348, SEQ ID NO: .1350, SEQ 
ID NO: 1032, SEQ ID NO: 1 053. SEQ ID NO: 1081, SEQ ID NO: 1 124, SEQ ID NO: 
1382, SEQ ID NO: 1437, SEQ ID NO: 1263, SEQ ID NO: 1 173, SEQ ID NO: 1405, SEQ 
ID NO: 1406, SEQ ID NO: 1410. SEQ ID NO: 1086, SEQ ID NO: 1322, SEQ ID NO: 

35 1266, SEQ ID NO: 1282, SEQ ID NO: 1271, SEQ ID NO: 1208, SEQ ID NO: 1 126, SEQ 
ID NO: 1270, SEQ ID NO: 1278, SEQ ID NO: 1419, SEQ ID NO: 1 125, SEQ ID NO: 
1181, SEQIDNO: 1416, SEQIDNO: ,1096, SEQIDNO: 1082, SEQIDNO: 1146, SEQ 
ID NO: 1 145, SEQ ID NO: 1 108, SEQ ID NO: 1 148, SEQ ID NO: 1337, SEQ ID NO: 
1338, SEQ ID NO: 1424, SEQ ID NO: 1000, SEQ ID NO: 1027, SEQ ID NO: 1 175, SEQ 
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ID NO: 1330, SEQ ID NO: 217, SEQ ID. NO: 217, SEQ ID NO: 367, SEQ ID NO: 
91 1.SEQ ID NO: 944, SEQ ID NO: 1 8, SEQ ID NO: 107, SEQ ID NO: 894, SEQ ID NO: 
943, SEQ ID NO: 203, SEQ ID NO: 85, SEQ ID NO: 290, SEQ ID NO: 5, SEQ ID NO: 
199, SEQ ID NO: 992, SEQ ID NO: 934, SEQ ID NO: 899, SEQ ID NO: 302, SEQ ID 

5 NO: 215, SEQ ID NO: 893, SEQ ID NO: 984, SEQ ID NO: 97,SEQ ID NO: 22, SEQ ID 
NO: 49, SEQ ID NO: 309, SEQ ID NO: 150, SEQ ID NO: 240, SEQ ID NO: 957, SEQ ID 
NO: 57, SEQ ID NO: 2, SEQ ID NO: 92, SEQ ID NO: 255, SEQ ID NO: 164, SEQ ID NO: 
201, SEQ ID NO: 278, SEQ ID NO: 245, SEQ ID NO: 921, SEQ ID NO: 896, SEQ ID 
NO: 248, SEQ ID NO: 159, SEQ ID NO: 979, SEQ ID NO: 194.SEQIDNO: 194.SEQ 

10 ID NO: 946, SEQ ID NO: 916, SEQ ID NO: 76, SEQ ID NO: 905, SEQ ID NO: 914, SEQ 
ID NO: 931, SEQ ID NO: 50, SEQ ID NO: 250, SEQ ID NO: 969, SEQ ID NO: 66.SEQ 
ID NO: 275, SEQ ID NO: 330, SEQ ID NO: 204, SEQ ID NO: 383, SEQ ID NO: 303, 
SEQ ID NO: 70, SEQ ID NO: 983, SEQ ID NO: 972, SEQ ID NO: 929, SEQ ID NO: 972, 
SEQ ID NO; 936, SEQ ID NO: 267, SEQ ID NO: 197, SEQ ID NO: 55, SEQ ID NO: 54, 

15 SEQ ID NO: 210, SEQ ID NO: 90, SEQ ID NO: 15, SEQ ID NO: 913, SEQ ID NO: 227, 
SEQ ID NO: 79, SEQ ID NO: 191, SEQ ID NO: 238, SEQ ID NO: 274, SEQ ID NO: 27, 
SEQ ID NO: 258, SEQ ID NO: 295, SEQ ID NO: 10, SEQ ID NO: 160, SEQ ID NO: 225, 
SEQ ID NO: 964, SEQ ID NO: 166, SEQ ID NO: 56, SEQ ID NO: 980,SEQ ID NO: 903, 
SEQ ID NO: 261, SEQ ID NO: 71, SEQ ID NO: 955, SEQ ID NO: 361, SEQ ID NO: 58, 

20 SEQ ID NO: 1 14, SEQ IDNO: 940, SEQ ID NO: 960, SEQ ID NO: 144, SEQ ID NO: 
362, SEQ ID NO: 40, SEQ ID NO: 285, SEQ ID NO: 11, SEQ ID NO: 161, SEQ ID NO: 
974, SEQ ID NO: 1 1 1, SEQ ID NO: 316, SEQ ID NO: 257, SEQ ID NO: 78, SEQ ID NO: 
966, SEQ ID NO: 352, SEQ ID NO: 981, SEQ ID NO: 158, SEQ ID NO: 989, SEQ ID 
NO: 963, SEQ ID NO: 48, SEQ ID NO: 68, SEQ ID NO: 135, SEQ ID NO: 910, SEQ ID 

25 NO: 236, SEQ ID NO: 241, SEQ ID NO: 949, SEQ ID NO: 945, SEQ ID NO: 207, SEQ 
ID NO: 977, SEQ ID NO: 978, SEQ ID NO: 994, SEQ ID NO: 163, SEQ ID NO: 256, 
SEQ ID NO: 287, SEQ ID NO: 184, SEQ ID NO: 45, SEQ ID NO: 136, SEQ ID NO: 214, 
SEQ ID NO: 16, SEQ ID NO: 192, SEQ ID NO: 373, SEQ ID NO: 892, SEQ ID NO: 239, 
SEQ ID NO: 34, SEQ ID NO: 340, SEQ ID NO: 41, SEQ ID NO: 332, SEQ ID NO: 134, 

30 and SEQ ID NO: 330. 

In one embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is 
an H. pylori flagella-associated polypeptide or a fragment thereof encoded by the nucleic 
acid selected from the group consisting ofSEQ ID NO: 1020, SEQ ID NO: 1021, SEQ ID 
NO: 1036, SEQ IDNO: 1050, SEQ ID NO: 1071, SEQ ID NO: 1101, SEQ ID NO: 1135, 

35 SEQ ID NO: 1276, SEQ ID NO: 1 150, SEQ ID NO: 1 187, SEQ ID NO: 1 192, SEQ ID 
NO: 1361, SEQ ID NO: 1379, SEQ ID NO: 1399, SEQ ID NO: 1403, SEQ ID NO: 1400, 
SEQ ID NO: 1 1 89, SEQ ID NO: 217, SEQ ID NO: 367, SEQ ID NO: 91 1,SEQ ID NO: 
944, SEQ ID NO: 1 8, SEQ ID NO: 107, SEQ ID NO: 894, SEQ ID NO: 943, SEQ ID NO: 
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203, SEQ ID NO: 85, SEQ ID NO: 290, SEQ ID NO: 5, SEQ ID NO: 199, SEQ ID NO: 
992, SEQ ID NO: 934, SEQ ID NO: 899, SEQ ID NO: 302, and SEQ ID NO: 215. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori inner membrane polypeptide or a fragment thereof encoded by the 

5 nucleic acid selected from the group consisting of SEQ ID NO: 1002, SEQ ID NO: 1213, 
SEQ ID NO: 1214, SEQ ID NO: 1215, SEQ ID NO: 1234, SEQ ID NO: 1236, SEQ ID 
NO: 1237, SEQ ID NO: 1224, SEQ ID NO: 125 1 , SEQ ID NO: 1262, SEQ ID NO: 1 149, 
SEQ ID NO: 1220, SEQ ID NO: 1240, SEQ ID NO: 1 164, SEQ ID NO: 1 165, SEQ ID 
NO: 1404, SEQ ID NO: 1 144, SEQ ID NO: 1 182, SEQ ID NO: 1 i57, SEQ JD NO: 1 160, 

10 SEQ ID NO: 1300, SEQ ID NO: 1321, SEQ ID NO: 1323, SEQ ID NO: 1329, SEQ ID 
NO: 1332, SEQ ID NO: 1345, SEQ ID NO: 1358, SEQ ID NO: 1375, SEQ ID NO: 1417, 
SEQ ID NO: 1283, SEQ ID NO: 1335, SEQ ID NO: 1368, SEQ ID NO: 1 179, SEQ ID 
NO: 1255, SEQ ID NO: 1258, SEQ ID NO: 1044, SEQ ID NO: 1273, SEQ ID NO: 893, 
SEQ ID NO: 984, SEQ ID NO: 97.SEQ ID NO: 22, SEQ ID NO: 49, SEQ ID NO: 309, 

15 SEQ ID NO: 150, SEQ ID NO: 240, SEQ ID NO: 957, SEQ ID NO: 57, SEQ ID NO: 2, 
SEQ ID NO: 92, SEQ ID NO: 255, SEQ ID NO: 164, SEQ ID NO: 201, SEQ ID NO: 278, 
SEQ ID NO: 245, SEQ ID NO: 921, SEQ ID NO: 896, SEQ ID NO: 248, SEQ ID NO: 
159, SEQ ID NO: 979, SEQ ID NO: 194, SEQ ID NO: 194, SEQ ID NO: 946, SEQ ID 
NO: 916, SEQ ID NO: 76, SEQ ID NO: 905, SEQ ID NO: 914, SEQ ID NO: 931, SEQ ID 

20 NO: 50, SEQ ID NO: 250, SEQ ID NO: 969, SEQ ID NO: 66.SEQ ID NO: 275, SEQ ID 
NO: 330, SEQ ID NO: 204, SEQ ID NO: 383, SEQ ID NO: 303, SEQ ID NO: 70, SEQ ID 
NO: 983, SEQ ID NO: 972, SEQ ID NO: 929, SEQ ID NO: 972, SEQ ID NO: 936, SEQ 
ID NO: 267, SEQ ID NO: 1 97, SEQ ID NO: 55, SEQ ID NO: 54, and SEQ ID NO: 21 0. 
In yet another embodiment, the H. pylori cell envelope polypeptide or a fragment 

25 thereof is an H. pylori transporter polypeptide or a fragment thereof encoded by the nucleic 
acid selected from the group consisting of SEQ ID NO: 1219, SEQ ID NO: 1274, SEQ ID 
NO: 1210, SEQ ID NO: 1422, SEQ ID NO: 1302, SEQ ID NO: 1308, SEQ ID NO: 1310, 
SEQ ID NO: 1331, SEQ ID NO: 1432, SEQ ID NO: 1052, SEQ ID NO: 1091, SEQ ID 
NO: 1421, SEQ ID NO: 1069, SEQ ID NO: 1005, SEQ ID NO: 1007, SEQ ID NO: 1 166, 

30 SEQ ID NO: 1 177, SEQ ID NO: 1 193, SEQ ID NO: 1206, SEQ ID NO: 1207, SEQ ID 
NO: 1304, SEQ ID NO: 1305, SEQ ID NO: 1346, SEQ ID NQ: 1348, SEQ ID NO: 1350, 
SEQ ID NO: 1 032, SEQ ID NO: 1053, SEQ ID NO: 108 1, SEQ ID NO: 1 124, SEQ ID 
NO: 1382, SEQ ID NO: 1437, SEQ ID NO: 1263, SEQ ID NO: 90, SEQ ID NO: 15, SEQ 
ID NO: 913, SEQ ID NO: 227, SEQ ID NO: 79, SEQ ID NO: 191, SEQ ID NO: 238, SEQ 

35 ID NO: 274, SEQ ID NO: 27, SEQ ID NO: 258, SEQ ID NO: 295, SEQ ID NO: 1 0, SEQ 
ID NO: 160, SEQ ID NO: 225, SEQ ID NO: 964, SEQ ID NO: 166, SEQ ID NO: 56, SEQ 
ID NO: 980,SEQ ID NO: 903, SEQ ID NO: 261, SEQ ID NO: 71, SEQ ID NO: 955, SEQ 
ID NO: 361, SEQ ID NO: 58, SEQ ID NO: 1 14, SEQ ID NO: 940, SEQ ID NO: 960, SEQ 
ID NO: 144, SEQ ID NO: 362, SEQ ID NO: 40, SEQ ID NO: 285, SEQ ID NO: 1 1, SEQ 
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ID NO: 161, SEQ ID NO: 974, SEQ ID NO: 1 1 1, SEQ ID NO: 316, SEQ ID NO: 257, 
SEQ ID NO: 78, and SEQ ID NO: 966. 

In yet a further embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an K pylori outer membrane polypeptide or a fragment thereof encoded by the 
5 nucleic acid selected from the group consisting of SEQ ID NO: 1 1 73, SEQ ID NO: 1405, 
SEQ ID NO: 1406, SEQ ID NO: 1410, SEQ ID NO: 1086, SEQ ID NO: 1322, SEQ ID 
NO: 1266, SEQ ID NO: 1282, SEQ ID NO: 1271, SEQ ID NO: 1208, SEQ ID NO: 1126, 
SEQ ID NO: 1270, SEQ ID NO: 1278, SEQ ID NO: 1419, SEQ ID NO: 1 125, SEQ ID 
NO: 1181, SEQ ID NO: 1416, SEQ ID NO: 1096, SEQ ID NO: 1082, SEQ ID NO: 1146, 

10 SEQ ID NO: 1 145, SEQ ID NO: 1 108, SEQ ID NO: 1 148, SEQ ID NO: 1337, SEQ ID 
NO: 1338.SEQIDNO: 1424, SEQ ID NO: 1000, SEQ ID NO: 1027, SEQ ID NO: 1175, 
SEQ ID NO: 1330, SEQ ID NO: 352, SEQ ID NO: 981, SEQ ID NO: 158, SEQ ID NO: 
989, SEQ ID NO: 963, SEQ EDNO: 48, SEQ ID NO: 68, SEQ ID NO: 135, SEQ ID NO: 
910, SEQ ID NO: 236, SEQ ID NO: 241, SEQ ID NO: 949, SEQ ID NO: 945, SEQ ID 

15 NO: 207, and SEQ ID NO: 977. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide sequence 
encoding an H. pylori cytoplasmic polypeptide or a fragment thereof. Such, nucleic acid is 
selected from the group consisting of SEQ ID NO: 1 147, SEQ ID NO: 1288, SEQ ID NO: 
1324, SEQ ID NO: 1363, SEQ ID NO: 997.SEQIDNO: 1015, SEQ ID NO: 1084, SEQ 

20 ID NO: 1094, SEQ ID NO: 1099, SEQ ID NO: 1229, SEQ ID NO: 1250, SEQ ID NO: 
1268, SEQ ID NO: 1293, SEQ ID NO: 1339, SEQ ID NO: 1408, SEQ ID NO: 1429, SEQ 
ID NO: 1434, SEQ ID NO: 1 228, SEQ ID NO: 103 1, SEQ ID NO: 1034, SEQ ID NO: 
1008, SEQ ID NO: 1061, SEQ ID NO: 1064, SEQ ID NO: 1191, SEQ ID NO: 1217.SEQ 
ID NO: 1365, SEQ ID NO: 1 394, SEQ ID NO: 1414, SEQ ID NO: 1415, SEQ ID NO: . 

25 1435, SEQ ID NO: 1058, SEQ ID NO: 1059, SEQ ID NO: 1080, SEQ ID NO: 1 128, SEQ 
ID NO: 1133, SEQ ID NO: 1211.SEQIDNO: 1252, SEQ ID NO: 1253, SEQ ID NO: 
1286, SEQ ID NO: 1289. SEQ ID NO: 1291, SEQ ID NO: 1303, SEQ ID NO: 1396, SEQ 
ID NO: 996, SEQ ID NO: 1 095, SEQ ID NO: 1 156, SEQ ID NO: 1 158, SEQ ID NO: 1 1 59, 
SEQ ID NO: 1277, SEQ ID NO: 1038, SEQ ID NO: 1257, SEQ ID NO: 1357, SEQ ID 

30 NO: 1436, SEQ ID NO: 1047.SEQIDNO: 1055, SEQ ID NO: SEQ ID NO: 1141, SEQ 
ID NO: 1227, SEQ ID NO: 1 327. SEQ ID NO: 1412, SEQ ID NO: 1003, SEQ ID NO: 
1087, SEQ ID NO: 1 1 16, SEQ ID NO: 1 130, SEQ ID NO: 1 132, SEQ ID NO: 1 185, SEQ 
ID NO: 1 188, SEQ ID NO: 1 198. SEQ ID NO: 1218, SEQ ID NO: 1244, SEQ ID NO: 
1306, SEQ ID NO: 1325, SEQ ID NO: 1397, SEQ ID NO: 1398, SEQ ID NO: 1407, SEQ 

35 ID NO: 1433, SEQ ID NO: 1216, SEQ ID NO: 1239, SEQ ID NO: 1362, SEQ ID NO: 
1017, SEQ ID NO: 1019, SEQ ID NO: 1360, SEQ ID NO: 1423, SEQ ID NO: 1425, SEQ 
ID NO: 1374, SEQ ID NO: 1028. SEQ ID NO: 1037, SEQ ID NO: 1077, SEQ ID NO: 
1115, SEQ ID NO: 1232, SEQ ID NO: 1241, SEQ ID NO: 1267, SEQ ID NO: 1 163, SEQ 
ID NO: 1068, SEQ ID NO: 1025, SEQ ID NO: 1042, SEQ ID NO: 1046, SEQ ID NO: 
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1056, SEQ ID NO: 1039, SEQ ID NO: '1072, SEQ ID NO: 1073, SEQ ID NO: 1092, SEQ 
ID NO: 1 100, SEQ ID NO: 1 102, SEQ ID NO: 1 103, SEQ ID NO: 1 104, SEQ ID NO: 
1111, SEQ ID NO: 1119.SEQIDNO: 1136, SEQ ID NO: 1 137, SEQ ID NO: 1140, SEQ 
ID NO: 1 142, SEQ ID NO: 1233, SEQ ID NO: 1238, SEQ ID NO: 1243, SEQ ID NO: 
5 1245, SEQ ID NO: 1247, SEQ ID NO: 1249, SEQ ID NO: 1261, SEQ ID NO: 1269, SEQ 
ID NO: 1279, SEQ ID NO: 1284, SEQ ID NO: 1290, SEQ ID NO: 1297, SEQ ID NO: 
1328, SEQ ID NO: 1370, SEQ ID NO: 1372, SEQ ID NO: 1377, SEQ ID NO: 1383, SEQ 
ID NO: 1384, SEQ ID NO: 1385, SEQ ID NO: 1388, SEQ ID NO: 1401, SEQ ID NO: 
1402, SEQ ID NO: 1418, SEQ ID NO: 1420, SEQ ID NO: 1427.SEQIDNO: 1070, SEQ 

10 ID NO: 1151, SEQIDNO: 1176, SEQ ID NO: 999, SEQ ID NO: 1006, SEQ ID NO: 1012, 
SEQ ID NO: 1018, SEQ ID NO: 1030, SEQ ID NO: 1033, SEQ ID NO: 1041, SEQ ID 
NO: 1049, SEQ ID NO: 1054, SEQ ID NO: 1057, SEQ ID NO: 1090, SEQ ID NO: 1097, 
SEQ ID NO: 1 129, SEQ ID NO: 1 1 39, SEQ ID NO: 1 143, SEQ ID NO: 11 52, SEQ ID 
NO: 1153.SEQIDNO: 1155.SEQEDNO: 1161.SEQIDNO: 1162, SEQIDNO: 1169, 
* 15 SEQIDNO: 1 170, SEQ ID NO: 1171.SEQIDNO: 1180, SEQIDNO: 1194, SEQ ID 

NO: 1 195, SEQ ID NO: 1 199, SEQ ID NO: 1200, SEQ ID NO: 1201, SEQ ID NO: 1202, 
SEQ ID NO: 1205, SEQ ID NO: 1312, SEQ ID NO: 1336, SEQ ID NO: 1349, SEQ ID 
NO: 1355, SEQ ID NO: 1359, SEQ ID NO: 1413, SEQ ID NO: 1426, SEQ ID NO: 1430, 
SEQ ID NO: 882, SEQ ID NO: 382, SEQ ID NO: 130, SEQ ID NO: 230, SEQ ID NO: 

20 269,SEQIDNO:312,SEQIDNO:211,SEQIDNO:959,SEQIDNO:938,SEQID 
NO: 110,SEQIDNO:244,SEQIDNO:328,SEQIDNO:235,SEQIDNO:315,SEQ 
ID NO: 296, SEQ ID NO: 976, SEQ ID NO: 321, SEQ ID NO: 43, SEQ ID NO: 281, SEQ 
ID NO: 326, SEQ ID NO: 272, SEQ ID NO: 344, SEQ ID NO: 139, SEQ ID NO: 30, SEQ 
ID NO: 220, SEQ ID NO: 364, SEQ ID NO: 369, SEQ ID NO: 372, SEQ ID NO: 991, 

25 SEQ ID NO: 128, SEQ ID NO: 347, SEQ ID NO: 52, SEQ ID NO: 12, SEQ ID NO: 247, 
SEQ ID NO: 64, SEQ ID NO: 101, SEQIDNO: 338, SEQIDNO: 83,SEQ ID NO: 46, 
SEQ ID NO: 348, SEQ ID NO: 223, SEQ ID NO: 39, SEQ ID NO: 232, SEQ ID NO: 1 68, 
SEQ ID NO: 65, SEQ ID NO: 952, SEQ ID NO: 341, SEQ ID NO: 69, SEQ ID NO: 924, 
SEQ ID NO: 4, SEQ ID NO: 197, SEQ ID NO: 313, SEQ ID NO: 1 19, SEQ ID NO: 188, 

30 SEQ ID NO: 956, SEQ ID NO: 935, SEQ ID NO: 246, SEQ ID NO: 196, SEQ ID NO: 
376, SEQ ID NO: 172, SEQ ID NO: 25, SEQ ID NO: 126, SEQ ID NO: 951, SEQ ID NO: 
147, SEQ ID NO: 895, SEQ ID NO: 14, SEQ ID NO: 154, SEQ ID NO: 277, SEQ ID NO: 
363, SEQ ID NO: 342, SEQ ID NO: 378, SEQ ID NO: 130, SEQ ID NO; 198, SEQ ID 
NO: 243, SEQ ID NO: 19, SEQ ID NO: 9, SEQ ID NO: 149, SEQ ID NO: 167, SEQ ID 

35 NO: 349, SEQ ID NO: 209, SEQ ID NO: 990, SEQ ID NO: 1 85, SEQ ED NO: 883, SEQ 
ID NO: 8, SEQ ID NO: 887, SEQ ID NO: 350, SEQ ED NO: 987, SEQ ID NO: 63, SEQ ID 
NO: 249, SEQ ID NO: 1 18, SEQ ID NO: 132, SEQ ID NO: 47, SEQ ID NO: 106, SEQ ED 
NO: 324, SEQ ID NO: 155, SEQ ID NO: 121, SEQ ID NO: 153, SEQ ID NO: 87, SEQ ID 
NO: 986, SEQ ID NO: 262, SEQ ID NO: 333, SEQ ID NO: 36, SEQ ID NO: 982, SEQ ID 
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NO: 1 80, SEQ ID NO: 84, SEQ ID NO: 900, SEQ ID NO: 20, SEQ ID NO: 7, SEQ ID NO: 
61, SEQ ID NO: 253, SEQ ID NO: 120, SEQ ID NO: 268, SEQ ID NO: 299, SEQ ID NO: 
942, SEQ ID NO: 173, SEQ ID NO: 187, SEQ ID NO: 187, SEQ ID NO: 234, SEQ ID 
NO: 1 12, SEQ ID NO: 324, SEQ ID NO: 971, SEQ ID NO: 62, SEQ ID NO: 308, SEQ ID 
5 NO: 74, SEQ ID NO: 1, SEQ ID NO: 266, SEQ ID NO: 337, SEQ ID NO: 93, SEQ ID NO: 
44, SEQ ID NO: 335, SEQ ID NO: 368, SEQ ID NO: 208, SEQ ID NO: 358, SEQ ID NO: 
923, SEQ ID NO: 3 10, SEQ ID NO: 26, SEQ ID NO: 279, SEQ ID NO: 890, SEQ ID NO: 
325, SEQ ID NO: 1 09, SEQ ID NO: 143, SEQ ID NO: 918, SEQ ID NO: 252, SEQ ID 
NO: 953, SEQ ID NO: 902, SEQ ID NO: 1 74, SEQ ID NO: 73, SEQ ID NO: 898, SEQ ID 
10 NO: 300, SEQ ID NO: 356, SEQ ID NO: 298, SEQ ID NO: 354, SEQ ID NO: 138, SEQ 
ID NO: 319, SEQ ID NO: 80, SEQ ID NO: 933, SEQ ID NO: 891, SEQ ID NO: 366, SEQ 
ID NO: 113, SEQ ID NO: 320, SEQ ID NO: 915, SEQ ID NO: 351, SEQ ID NO: 162, 
SEQ ID NO: 965, SEQ ID NO: 67, SEQ ID NO: 3 14, SEQ ID NO: 904, SEQ ID NO: 345, 
SEQ ID NO: 374, SEQ ID NO: 962, SEQ ID NO: 270, SEQ ID NO: 186, SEQ ID NO: 60, 
15 SEQ ID NO: 379, SEQ ID NO: 889, SEQ ID NO: 967, SEQ ID NO: 973, SEQ ID NO: 
280, SEQ ID NO: 1 70, SEQ ID NO: 985, and SEQ ID NO: 932. 

In one embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is 
an H. pylori polypeptide or a fragment thereof involved in energy conversion encoded by 
the nucleic acid selected from the group consisting of SEQ ID NO: 1 147, SEQ ID NO: 
20 1288, SEQ ID NO: 1324, SEQ ID NO: 1363, SEQ ID NO: 882, SEQ ID NO: 382, SEQ ID 
NO: 130, and SEQ ID NO: 230. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in amino acid metabolism 
encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 997, SEQ 
25 ID NO: 1015, SEQ ID NO: 1084, SEQ ID NO: 1094, SEQ ID NO: 1099, SEQ ID NO: 
1229, SEQ ID NO: 1250, SEQ ID NO: 1268, SEQ ID NO: 1293, SEQ ID NO: 1339, SEQ 
ID NO: 1408, SEQ ID NO: 1429, SEQ ID NO: 1434, SEQ ID NO: 1228, SEQ ID NO: 
1031, SEQ ID NO: 1034, SEQ ID NO: 1008, SEQ ID NO: 269, SEQ ID NO: 312, SEQ ID 
NO: 21 1, SEQ ID NO: 959, SEQ ID NO: 938, SEQ ID NO: 110, SEQ ID NO: 244, SEQ 
30 ID NO: 328, SEQ ID NO: 235, SEQ ID NO: 315, SEQ ID NO: 296, SEQ ID NO: 976, 
SEQ ID NO: 321, SEQ ID NO: 43, SEQ ID NO: 281, SEQ ID NO: 326, and SEQ ID NO: 
272. 

In yet another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in nucleotide metabolism 
35 encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 1061 , SEQ 
ID NO: 1064, SEQ ID NO: 1 191, SEQ ID NO: 1217, SEQ ID NO: 1365, SEQ ID NO: 
1394, SEQ ID NO: 1414, SEQ ID NO: 1415, SEQ ID NO: 1435, SEQ ID NO: 1058, SEQ 
ID NO: 1059, SEQ ID NO: 344, SEQ ID NO: 139, SEQ ID NO: 30, SEQ ID NO: 220, 
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SEQ ID NO: 364, SEQ ID NO: 369, SEQ ID NO: 372, SEQ ID NO: 991, SEQ ID NO: 
128, SEQ ID NO: 347, and SEQ ID NO: 52. 

In yet a further embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in carbohydrate 

~ 5 metabolism encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 
1080, SEQ ID NO: 1 128, SEQ ID NO: 1 133, SEQ ID NO: 121 1, SEQ ID NO: 1252, SEQ 
ID NO: 1253, SEQ ID NO: 1286, SEQ ID NO: 1289, SEQ ID NO: 1291, SEQ ID NO: 
1303, SEQ ID NO: 1396, SEQ ID NO: 996, SEQ ID NO: 12, SEQ ID NO: 247, SEQ ID 
NO: 64, SEQ ID NO: 101, SEQ ID NO: 338, SEQ ID NO: 83, SEQ ID NO: 46, SEQ ID 

10 NO: 348, SEQ ID NO: 223, SE.Q ID NO: 39, SEQ ID NO: 232, and SEQ ID NO: 168. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in cofactor metabolism encoded 
by the nucleic acid selected from the group consisting of SEQ ID NO: 1095, SEQ ID NO: 
1 1 56, SEQ ID NO: 1 158, SEQ ID NO: 1 1 59, SEQ ID NO: 1277, SEQ ID NO: 1038, SEQ 

15 ID NO: 65, SEQ ID NO: 952, SEQ ID NO: 341, SEQ ID NO: 69, SEQ ID NO: 924, and 
SEQ ID NO: 4. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in lipid metabolism encoded by '■ 
the nucleic acid selected from the group consisting of SEQ ID NO: 1257, SEQ ID NO: 

20 1357, SEQ ID NO: 1436, SEQ ID NO: 197, SEQ ID NO: 313, and SEQ ID NO: 119. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in mRNA translation and 
ribosome biogenesis encoded by the nucleic acid selected from the group consisting of SEQ 
ID NO: 1 047, SEQ ID NO: 1055, SEQ ID NO: SEQ ID NO: 1141, SEQ ID NO: 1227, . 

25 SEQ ID NO: 1327, SEQ ID NO: 1412, SEQ ID NO: 188, SEQ ID NO: 956, SEQ ID NO: 
935, SEQ ID NO: 246, SEQ ID NO: 196, and SEQ ID NO: 376. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in genome replication, 
transcription, recombination and repair encoded by the nucleic acid selected from the group 

30 consisting of SEQ ID NO: 1003, SEQ ID NO: 1087, SEQ ID NO: 1 1 16, SEQ ID NO: 1 130, 
SEQ ID NO: 1132,SEQIDNO: 1185,SEQ ID NO: 1188,SEQIDNO: 1198, SEQ ID 
NO: 1218, SEQ ID NO: 1244, SEQ ID NO: 1306, SEQ ID NO: 1325, SEQ ID NO: 1397, 
SEQ ID NO: 1398, SEQ ID NO: 1407, SEQ ID NO: 1433, SEQ ID NO: 172, SEQ ID NO 
25, SEQ ID NO: 126, SEQ ID NO: 951, SEQ ID NO: 147, SEQ ID NO: 895, SEQ ID NO 

35 14, SEQ ID NO: 154, SEQ ID NO: 277, SEQ ID NO: 363, SEQ ID NO: 342, SEQ ID NO 
378, SEQ ID NO: 130, SEQ ID NO: 198, SEQ ID NO: 243, SEQ ID NO: 19, and SEQ ID 
NO: 9. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in outer membrane or cell wall 
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biosynthesis encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 
1216, SEQ ID NO: 1239, SEQ ID NO: 1362, SEQ ID NO: 1017, SEQ ID NO: 1019, SEQ 
ID NO: 1360, SEQ ID NO: 149, SEQ ID NO: 167, SEQ ID NO: 349, SEQ ID NO: 209, 
SEQ ID NO: 990, SEQ ID NO: 1 85, SEQ ID NO: 883, and SEQ ID NO: 8. 

5 In yet another embodiment, the H. pylori cytoplasmic polypeptide is an if. pylori 

chaperone polypeptide or a fragment thereof encoded by the nucleic acid selected from the 
group consisting of SEQ ID NO: 1423, SEQ ID NO: 1425, SEQ ID NO: 1374, SEQ ID 
NO: 887, SEQ ID NO: 350, and SEQ ID NO: 987. 

Particularly preferred is an isolated nucleic acid comprising a nucleotide sequence 

10 encoding an H. pylori secreted or periplasmic polypeptide or a fragment thereof. Such 

nucleic acid is selected from the group consisting of SEQ ID NO: 1004, SEQ ID NO: 1 138, 
SEQ ID NO: 1067, SEQ ID NO: 1078, SEQ ID NO: 1314, SEQ ID NO: 1319, SEQ ID 
NO: 1378, SEQ ID NO: 1 105, SEQ ID NO: 1 1 14, SEQ ID NO: 1118, SEQ ID NO: 1 120, 
SEQ ID NO: 1 123, SEQ ID NO: 1 127, SEQ ID NO: 1212, SEQ ID NO: 1223, SEQ ID 

1 5 NO: 1225, SEQ ID NO: 1246, SEQ ID NO: 1248, SEQ ID NO: 1259, SEQ ID NO: 1264, 
SEQIDNO: 1265.SEQIDNO: 1281, SEQ ID NO: 1285, SEQ ID NO: 1294.SEQID 
NO: 1298, SEQ ID NO: 1299, SEQ ID NO: 1315, SEQ ID NO: 1316, SEQ ID NO: 1317, 
SEQ ID NO: 1318, SEQ ID NO: 1344, SEQ ID NO: 1351, SEQ ID NO: 1353, SEQ ED 
NO: 1373, SEQ ID NO: 1380, SEQ ID NO: 1387, SEQ ID NO: 1389, SEQ ID NO: 1393, 

20 SEQ ID NO: 141 1, SEQ ID NO: 1428, SEQ ID NO: 1431, SEQ ID NO: 1439, SEQ ID 
NO: 1043, SEQIDNO: 1183, SEQ ID NO: 1184, SEQ ID NO: 1 196, SEQ ID NO: 1197, 
SEQ ID NO: 1203, SEQ ID NO: 995, SEQ ID NO: 998, SEQ ID NO: 1001 , SEQ ID NO: 
1022, SEQ ID NO: 1023, SEQ ID NO: 1029, SEQ ID NO: 1040, SEQ ID NO: 1051, SEQ 
ID NO: 1062, SEQ ID NO: 1 154, SEQ ID NO: 1320, SEQ ID NO: 1075, SEQ ID NO: 

25 1106, SEQIDNO: 1109, SEQIDNO: 1134, SEQIDNO: 1221, SEQIDNO: 1226, SEQ 
ID NO: 1235, SEQ ID NO: 1301, SEQ ID NO: 131 1, SEQ ID NO: 1326, SEQ ID NO: 
1341, SEQ ID NO: 1354, SEQ ID NO: 1364, SEQ ID NO: 1366, SEQ ID NO: 1376, SEQ 
ID NO: 1391 , SEQ ID NO: 1395, SEQ ID NO: 1445, SEQ ID NO: 1 079, SEQ ID NO: 
1 186, SEQ ID NO: 1010, SEQ ID NO: 1016, SEQ ID NO: 1 172, SEQ ID NO: 1 174, SEQ 

30 ID NO: 1 1 7, SEQ ID NO: 254, SEQ ID NO: 24, SEQ ID NO: 242, SEQ ID NO: 950, SEQ 
ID NO: 263, SEQ ID NO: 286, SEQ ID NO: 947, SEQ ID NO: 51, SEQ ID NO: 177, SEQ 
H) NO: 156, SEQ ID NO: 190, SEQ ID NO: 375, SEQ ID NO: 222, SEQ ID NO: 21, SEQ 
ID NO: 912, SEQ ID NO: 148, SEQ ID NO: 202, SEQ ID NO: 224, SEQ ID NO: 1 12, 
SEQ ID NO: 32, SEQ ID NO: 339, SEQ ID NO: 182, SEQ ID NO: 228, SEQ ID NO: 152, 

35 SEQ ID NO: 219, SEQ ID NO: 137, SEQ ID NO: 318, SEQ ID NO: 141, SEQ ID NO: 
165, SEQ ID NO: 334, SEQ ID NO: 13, SEQ ID NO: 297, SEQ ID NO: 35, SEQ ID NO: 
216, SEQ ID NO: 908, SEQ ID NO: 124, SEQ ID NO: 75, SEQ ID NO: 927, SEQ ID NO: 
221, SEQ ID NO: 178, SEQ ID NO: 169, SEQ ID NO: 293, SEQ ID NO: 289, SEQ ID 
NO: 926, SEQ ID NO: 948, SEQ ID NO: 1 15, SEQ ID NO: 25 1 , SEQ ID NO: 345, SEQ 
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ID NO: 17, SEQ ID NO: 920, SEQ ID NO: 95, SEQ ID NO: 86, SEQ ID NO: 360, SEQ ID 
NO: 271, SEQ ID NO: 970, SEQ ID NO: 288, SEQ ID NO: 282, SEQ ID NO: 98, SEQ ID 
NO: 29, SEQ ID NO: 317, SEQ ID NO: 343, SEQ ID NO: 291, SEQ ID NO: 108, SEQ ID 
NO: 377, SEQ ID NO: 305, SEQ ID NO: 305, SEQ ID NO: 100, SEQ ID NO: 988, SEQ 

5 ID NO: 212, SEQ ID NO: 884, SEQ ID NO: 37, SEQ ID NO: 968, SEQ ID NO: 975, SEQ 
ID NO: 237, SEQ ID NO: 335, SEQ ID NO: 260, SEQ ID NO: 370, SEQ ID NO: 91, SEQ 
ID NO: 276, SEQ ID NO: 311, SEQ ID NO: 173, SEQ ID NO: 102, SEQ ID NO: 304, 
SEQ ID NO: 380, SEQ ID NO: 127, SEQ ID NO: 993, SEQ ID NO: 925, SEQ ID NO: 
181, and SEQ ID NO: 171. 

1 0 Particularly preferred is an isolated nucleic acid comprising a nucleotide sequence 

encoding an H. pylori surface or membrane polypeptide or a fragment thereof. Such 
nucleic acid is selected from the group consisting of SEQ ID NO: 1060, SEQ ID NO: 1110, 
SEQ ID NO: 1 1 12, SEQ ID NO: 1230, SEQ ID NO: 1260, SEQ ID NO: 1280, SEQ ID 
NO: 1292, SEQ ID NO: 1296, SEQ ID NO: 1307, SEQ ID NO: 1442, SEQ ID NO: 1444, 

15 SEQ ID NO: 1 122, SEQ ID NO: 1254, SEQ ID NO: 1256, SEQ ID NO: 1272, SEQ ID 
NO: 1275, SEQ ID NO: 1309, SEQ ID NO: 1313, SEQ ID NO: 1347, SEQ ID NO: 1352, 
SEQ ID NO: 1356, SEQ ID NO: 1438, SEQ ID NO: 1441, SEQ ED NO: 1009, SEQ ID 
NO: 1026, SEQ ID NO: 1048, SEQ ID NO: 1063, SEQ ID NO: 1190, SEQ ID NO: 1083, 
SEQ ID NO: 1 1 13, SEQ ID NO: 1222, SEQ ID NO: 1295, SEQ ID NO: 1343, SEQ ID 

20 NO: 1392, SEQ ID NO: 1443, SEQ ID NO: 1085, SEQ ID NO: 1093, SEQ ID NO: 1 1 17, 
SEQ ID NO: 1 121, SEQ ID NO: 1131, SEQ ID NO: 1287, SEQ ED NO: 1440, SEQ ID 
NO: 1209, SEQ ID NO: 1342, SEQ ID NO: 1381, SEQ ID NO: 1390, SEQ ID NO: 1409, 
SEQ ID NO: 1035, SEQ ID NO: 1014, SEQ ED NO: 1088, SEQ ED NO: 1242, SEQ ED 
NO: 1178, SEQ ID NO: 1089, SEQ ID NO: 1340, SEQ ED NO: 1074, SEQ ED NO: 1107, 

25 SEQ ED NO: 1204, SEQ ID NO: 1066, SEQ ID NO: 381, SEQ ED NO: 229, SEQ ID NO: 
323, SEQ ID NO: 371, SEQ ED NO: 284, SEQ ID NO: 1 16, SEQ ID NO: 3, SEQ ID NO: 
6, SEQ ID NO: 907, SEQ ID NO: 1 93, SEQ ID NO: 145, SEQ ID NO: 59, SEQ ID NO: 
322, SEQ ID NO: 94, SEQ ID NO: 306, SEQ ED NO: 939, SEQ ID NO: 205, SEQ ID NO: 
123, SEQ ED NO: 906, SEQ ID NO: 928, SEQ ED NO: 346, SEQ ID NO: 129, SEQ ID 

30 NO: 307, SEQ ID NO: 133, SEQ ID NO: 131, SEQ ED NO: 886, SEQ ID NO: 179, SEQ 
ID NO: 104, SEQ ID NO: 213, SEQ ED NO: 359, SEQ ID NO: 140, SEQ ID NO: 146, 
SEQ ID NO: 327, SEQ ID NO: 365, SEQ ID NO: 33, SEQ ED NO: 331, SEQ ID NO: 175, 
SEQ ED NO: 200, SEQ ID NO: 292, SEQ ID NO: 23, SEQ ID NO: 336, SEQ ID NO: 301 , 
SEQ ED NO: 28, SEQ ID NO: 941, SEQ ID NO: 103, SEQ ID NO: 231, SEQ ID NO: 176, 

35 SEQ ID NO: 31, SEQ ID NO: 917, SEQ ID NO: 151, SEQ ID NO: 922, SEQ ID NO: 265, 
SEQ ID NO: 142, SEQ ID NO: 259, SEQ ID NO: 122, SEQ ID NO: 206, SEQ ID NO: 96, 
SEQ ID NO: 353, SEQ ID NO: 38, SEQ ID NO: 89, SEQ ID NO: 77, SEQ ID NO: 954, 
SEQ ID NO: 264, SEQ ID NO: 937, SEQ ID NO: 226, SEQ ID NO: 283, SEQ ID NO: 88, 
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SEQ ID NO: 125, SEQ ID NO: 183, SEQ ID NO: 195, SEQ ID NO: 81, SEQ ID NO: 901* 
SEQ ID NO: 82, SEQ ID NO: 42, SEQ ID NO: 881, and SEQ ID NO: 885. 

In one embodiment, the H. pylori surface or membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof having at least one membrane 
5 spanning region encoded by the nucleic acid selected from the group consisting of SEQ ID 
NO: 1060, SEQ ID NO: 1110, SEQ ID NO: 1112, SEQ ID NO: 1230, SEQ ID NO: 1260, 
SEQ ID NO: 1280, SEQ ID NO: 1292, SEQ ID NO: 1296, SEQ ID NO: 1307, SEQ ID 
NO: 1442, SEQ ID NO: 1444, SEQ ID NO: 381 , SEQ ID NO: 229, SEQ ID NO: 323, SEQ 
ID NO: 371 , SEQ ID NO: 284, SEQ ID NO: 1 1 6, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID 

10 NO: 907, SEQ ID NO: 193, SEQ ID NO: 145, SEQ ID NO: 59, SEQ ID NO: 322, SEQ ID 
NO: 94, SEQ ID NO: 306, and SEQ ID NO: 881. 

In another embodiment, the H. pylori surface or membrane polypeptide or a 
fragment thereof is an if. pylori polypeptide or a fragment thereof having at least two 
membrane spanning regions encoded by the nucleic acid selected from the group consisting 

15 of SEQ ID NO: 1 122, SEQ ID NO: 1254, SEQ ID NO: 1256, SEQ ID NO: 1272, SEQ ID 
NO: 1275, SEQ ID NO: 1309, SEQ ID NO: 1313, SEQ ED NO: 1347, SEQ ID NO: 1352, 
SEQ ID NO: 1356, SEQ ID NO: 1438, SEQ ID NO: 1441, SEQ ID NO: 1009, SEQ ID 
NO: 1026, SEQ ID NO: 1048, SEQ ID NO: 1063, SEQ ID NO: 1 190, SEQ ID NO: 939, 
SEQ ID NO: 205, SEQ ID NO: 123, SEQ ID NO: 906, SEQ ID NO: 928, SEQ ID NO: 

20 346, SEQ ID NO: 129, SEQ ED NO: 307, SEQ ID NO: 133, SEQ ID NO: 131, SEQ ID 
NO: 886, SEQ ID NO: 179.SEQIDNO: 104, SEQ ID NO: 2 13, SEQ ID NO: 359, SEQ 
ID NO: 140, SEQ ID NO: 146, SEQ ID NO: 327, and SEQ ID NO: 365. 

In yet another embodiment, the H. pylori surface or membrane polypeptide or a 
fragment thereof is an H. pylori polypeptide or a fragment thereof having at least three 

25 membrane spanning regions encoded by the nucleic acid selected from the group consisting 
of SEQ ID NO: 1083, SEQ ID NO: 1113, SEQ ID NO: 1222, SEQ ID NO: 1295, SEQ ID 
NO: 1343, SEQ ID NO: 1392, SEQ ID NO: 1443, SEQ ID NO: 33, SEQ ID NO: 331, SEQ 
ID NO: 175, SEQ ID NO: 200, SEQ ID NO: 292, SEQ ID NO: 23, and SEQ ID NO: 336. 
In yet a further embodiment, the H. pylori surface or membrane polypeptide or a 

30 fragment thereof is an H. pylori polypeptide or a fragment thereof having at least four 

membrane spanning regions encoded by the nucleic acid selected from the group consisting 
of SEQ ID NO: 1085, SEQ ID NO: 1093, SEQ ID NO: 1117, SEQ ID NO: 1121, SEQ ID 
NO: 1131, SEQ ED NO: 1287, SEQ ID NO: 1440, SEQ ED NO: 1209, SEQ ID NO: 301, 
SEQ ID NO: 28, SEQ ID NO: 941, SEQ ID NO: 103, SEQ ID NO: 231, SEQ ID NO: 176, 

35 SEQ ID NO: 31, SEQ ID NO: 917, SEQ ID NO: 151, and SEQ ID NO: 922. 

In another embodiment, the H. pylori surface or membrane polypeptide or a 
fragment thereof is an H. pylori polypeptide or a fragment thereof having at least five 
membrane spanning regions encoded by the nucleic acid selected from the group consisting 
of SEQ ID NO: 1342, SEQ ID NO: 1381, SEQ ID NO: 1390, SEQ ID NO: 1409, SEQ ID 
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NO: 1035, SEQ ID NO: 265, SEQ ID NO: 142, SEQ ID NO: 259, SEQ ID NO: 122, SEQ 
ID NO: 206, and SEQ ID NO: 885. 

In another embodiment, the H. pylori surface or membrane polypeptide or a 
fragment thereof is an H. pylori polypeptide or a fragment thereof having at least six 

5 membrane spanning regions encoded by the nucleic acid selected from the group consisting 
of SEQ ID NO: 1014, SEQ ID NO: 1088, SEQ ID NO: 1242, SEQ ID NO: 1 178, SEQ ID 
NO: 96, SEQ ID NO: 353, SEQ ID NO: 38, SEQ ID NO: 89, SEQ ID NO: 77, SEQ ID NO: 
954, SEQ ID NO: 264. 

In another embodiment, the H. pylori surface or membrane polypeptide or a 

10 fragment thereof is an H. pylori polypeptide or a fragment thereof having at least seven 
membrane spanning regions encoded by the nucleic acid selected from the group consisting 
of SEQ ID NO: 1089, SEQ ID NO: 1340, SEQ ID NO: 1074, SEQ ID NO: 1 107, SEQ ID 
NO: 1204, SEQ ID NO: 1066, SEQ ID NO: 937.SEQIDNO: 226, SEQ ID NO: 283, SEQ 
ID NO: 88, SEQ ID NO: 125, SEQ ID NO: 183, SEQ ID NO: 195, SEQ ID NO: 81, SEQ 

15 ID NO: 901, SEQ ID NO: 82, and SEQ ID NO: 42. 

Particularly preferred is a purified or isolated H. pylori cell envelope polypeptide or 
a fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ 
ID NO: 147.1, SEQ ID NO: 1472, SEQ ID NO: 1487, SEQ ID NO: 1501, SEQ ID NO: 
1522, SEQ ID NO: 1552, SEQ ID NO: 1586, SEQ ID NO: 1727, SEQ ID NO: 1601, SEQ 

20 ID NO: 1638, SEQ ID NO: 1 643, SEQ ID NO: 1812, SEQ ID NO: 1830, SEQ ID NO: 
1850, SEQ ID NO: 1854, SEQ ID NO: 1851, SEQ ID NO: 1640, SEQ ID NO: 1453, SEQ 
ID NO: 1664, SEQ ID NO: 1665, SEQ ID NO: 1666, SEQ ID NO: 1685, SEQ ID NO: 
1687, SEQ ID NO: 1688, SEQ ID NO: 1675, SEQ ID NO: 1702, SEQ ID NO: 1713, SEQ 
ID NO: 1600, SEQ ID NO: 1671, SEQIDNO: 1691, SEQ ID NO: 1615, SEQ ID NO: 

25 1616, SEQ ID NO: 1 855, SEQ ID NO: 1595, SEQ ID NO: 1633, SEQ ID NO: 1608, SEQ 
ID NO: 1611, SEQ ID NO: 1751, SEQ ID NO: 1772, SEQIDNO: 1774, SEQIDNO: 
1780, SEQ ID NO: 1 783, SEQ ID NO: 1796, SEQ ID NO: 1809, SEQ ID NO: 1 826, SEQ 
ID NO: 1868, SEQ ID NO: 1734. SEQ ID NO: 1786, SEQ ID NO: 1819, SEQ ID NO: 
1630, SEQ ID NO: 1706. SEQ ID NO: 1709, SEQ ID NO: 1495, SEQ ID NO: 1724, SEQ 

30 ID NO: 1 670, SEQ ID NO: 1 725. SEQ ID NO: 1661 , SEQ ID NO: 1 873, SEQ ID NO: 
1753, SEQ ID NO: 1759. SEQ ID NO: 1761, SEQ ID NO: 1782, SEQ ID NO: 1883, SEQ 
ID NO: 1503, SEQ ID NO: 1 542. SEQ ID NO: 1872, SEQ ID NO: 1520, SEQ ID NO: 
1456, SEQ ID NO: 1458. SEQ ID NO: 1617, SEQ ID NO: 1628, SEQ ID NO: 1644, SEQ 
ID NO: 1 657, SEQ ID NO: 1 658. SEQ ID NO: 1755, SEQ ID NO: 1756, SEQ ID NO: 

35 1797, SEQ ID NO: 1799, SEQ ID NO: 1801, SEQ IDNO: 1483, SEQ ID NO: 1504, SEQ 
ID NO: 1532, SEQ IDNO: 1575. SEQ IDNO: 1833, SEQ IDNO: 1888, SEQ IDNO: 
1714, SEQ ID NO: 1624. SEQ ID NO: 1856, SEQ ID NO: 1857, SEQ ID NO: 1861, SEQ 
ID NO: 1537, SEQ ID NO: 1773, SEQ ID NO: 1717, SEQ ID NO: 1733, SEQ ID NO: 
1722, SEQ ID NO: 1659, SEQ ID NO: 1577, SEQ ID NO: 1721, SEQ ID NO: 1729, SEQ 
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IDNO: 1870, SEQ ID NO: 1576, SEQ ID NO: 1632, SEQIDNO: 1867,SEQIDNQ: 
1547, SEQ ID NO: 1533, SEQ ID NO: 1597, SEQ ID NO: 1596, SEQ ID NO: 1559, SEQ 
ID NO: 1599, SEQ ID NO: 1788, SEQ ID NO: 1789, SEQ ID NO: 1875, SEQ ID NO: 
1451, SEQ ID NO: 1478, SEQ ID NO: 1626, SEQ ID NO: 1781, SEQ ID NO: 660, SEQ 

5 ID NO: 660, SEQ ID NO: 855, SEQ ID NO: 534, SEQ ID NO: 675, SEQ ID NO: 404, 
SEQ ID NO: 518, SEQ ID NO: 464, SEQ ID NO: 672, SEQ ID NO: 640, SEQ ID NO: 
490, SEQ ID NO: 755, SEQ ID NO: 389, SEQ ID NO: 635, SEQ ID NO: 877, SEQ ID 
NO: 637, SEQ ID NO: 477, SEQ ID NO: 772, SEQ ID NO: 658, SEQ ID NO: 463, SEQ 
ID NO: 852, SEQ ID NO: 503, SEQ ID NO: 41 1, SEQ ID NO: 441, SEQ ID NO: 782, 

10 SEQ ID NO: 575, SEQ ID NO: 691, SEQ ID NO: 724, SEQ ID NO: 452, SEQ ID NO: 
386, SEQ ID NO: 497, SEQ ID NO: 712, SEQ ID NO: 591, SEQ ID NO: 638, SEQ ID 
NO: 740, SEQ ID NO: 697, SEQ ID NO: 569, SEQ ID NO: 470, SEQ ID NO: 700, SEQ 
ID NO: 586, SEQ ID NO: 823, SEQ ID NO: 627, SEQ ID NO: 627, SEQ ID NO: 684, 
SEQ ID NO: 551, SEQ ID NO: 478, SEQ ID NO: 508, SEQ ID NO: 545, SEQ ID NO: 

15 628, SEQ ID NO: 443, SEQ ID NO: 702, SEQ ID NO: 776, SEQ ID NO: 461, SEQ ID 
NO: 737, SEQ ID NO: 809, SEQ ID NO: £42, SEQ ID NO: 879, SEQ ID NO: 773, SEQ 
ID NO: 468, SEQ ID NO: 842, SEQ ID NO: 788, SEQ ID NO: 624, SEQ ID NO: 788, 
SEQ ID NO: 644, SEQ ID NO: 727, SEQ ID NO: 631, SEQ ID NO: 450, SEQ ID NO: 
448, SEQ ID NO: 653, SEQIDNO: 495, SEQ ID NO: 400, SEQ ID NO: 541, SEQ ID 

20 NO: 673, SEQ ID NO: 482, SEQ ID NO: 622, SEQ ID NO: 689, SEQ ID NO: 736, SEQ 
ID NO: 417, SEQ ID NO: 716, SEQ ID NO: 762, SEQ ID NO: 395, SEQ ID NO: 587, 
SEQ ID NO: 669, SEQ ID NO: 758, SEQ ID NO: 593, SEQ ID NO: 45 1 , SEQ ID NO: 
827, SEQ ID NO: 502, SEQ ID NO: 719, SEQ ID NO: 469, SEQ ID NO: 715, SEQ ID 
NO: 847, SEQ ID NO: 453, SEQ ID NO: 527, SEQ ID NO: 652, SEQ ID NO: 745, SEQ 

25 ID NO: 567, SEQ ID NO: 848, SEQ ID NO: 430, SEQ ID NO: 748, SEQ ID NO: 396, 
SEQ ID NO: 588, SEQ ID NO: 795, SEQ ID NO: 523, SEQ ID NO: 791, SEQ ID NO: 
714, SEQ ID NO: 481, SEQ ID NO: 765, SEQ ID NO: 837, SEQ ID NO: 833, SEQ ID 
NO: 585, SEQ ID NO: 865, SEQ ID NO: 764, SEQ ID NO: 440, SEQ ID NO: 465, SEQ 
ID NO: 555, SEQ ID NO: 526, SEQ ID NO: 687, SEQ ID NO: 692, SEQ ID NO: 693, 

30 SEQ ID NO: 677, SEQ ID NO: 649, SEQ ID NO: 812, SEQ ID NO: 820, SEQ ID NO: 
880, SEQ ID NO: 590, SEQ ID NO:713, SEQ ID NOt750, SEQ ID NO:613, SEQ ID NO: 
437, SEQ ID NO: 556, SEQ ID NO: 657, SEQ ID NO: 402, SEQ ID NO: 623, SEQ ID 
NO: 862, SEQ ID NO: 449, SEQ ID NO: 690, SEQ ID NO: 424, SEQ ID NO: 821, SEQ 
ID NO: 432, SEQ ID NO: 81 1, SEQ ID NO: 554, and SEQ ID NO: 809. 

35 In one embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is 

an H. pylori flagella-associated polypeptide or a fragment thereof selected from the group 
consisting of SEQ ID NO: 1471, SEQ ID NO: 1472, SEQ ID NO: 1487, SEQ ID NO: 1501, 
SEQ ID NO: 1522, SEQ ID NO: 1552, SEQ ID NO: 1586, SEQ ID NO: 1727, SEQ ID 
NO: 1601, SEQ ID NO: 1638, SEQIDNO: 1643, SEQIDNO: 1812, SEQIDNO: 1830, 
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SEQ ID NO: 1850, SEQIDNO: 1854,'sEQIDNO: 1851, SEQIDNO: 1640, SEQID 
NO: 660, SEQ ID NO: 660, SEQ ID NO: 855* SEQ ID NO: 534, SEQ ID NO: 675, SEQ 
ID NO: 404, SEQ ID NO: 518, SEQ ID NO: 464, SEQ ID NO: 672, SEQ ID NO: 640, 
SEQ ID NO: 490, SEQ ID NO: 755, SEQ ID NO: 389, SEQ ID NO: 635, SEQ ID NO: 

5 877, SEQ ID NO: 637, SEQ ID NO: 477, SEQ ID NO: 772, and SEQ ID NO: 658. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori inner membrane polypeptide or a fragment thereof selected from the 
group consisting of SEQ ID NO: 1453, SEQ ID NO: 1664, SEQ ID NO: 1665, SEQ ID 
NO: 1666, SEQ ID NO: 1685, SEQIDNO: 1687, SEQ ID NO: 1688, SEQIDNO: 1675, 

10 SEQ ID NO: 1702, SEQ ID NO: 1713, SEQ ID NO: 1600, SEQ ID NO: 1671, SEQ ID 
NO: 1691.SEQIDNO: 1615, SEQIDNO: 1616, SEQ ID NO: 1855, SEQIDNO: 1595, 
SEQIDNO: 1633, SEQIDNO: 1608, SEQ ID NO: 1611, SEQIDNO: 1751, SEQ ID 
NO: 1772, SEQ ID NO: 1774, SEQ ID NO: 1780, SEQ ID NO: 1783, SEQ ID NO: 1796, 
SEQ ID NO: 1809, SEQ ID NO: 1826, SEQ ID NO: 1868, SEQ ID NO: 1734, SEQ ID 

15 NO: 1786, SEQ ID NO: 1819, SEQ ID NO: 1630, SEQ ID NO: 1706, SEQ ID NO: 1709, 
SEQ ID NO: 1495, SEQ ID NO: 1724, SEQ ID NO: 463, SEQ ID NO: 852, SEQ ID NO: 
503, SEQ ID NO: 41 1, SEQ ID NO: 441, SEQ ID NO: 782, SEQ ID NO: 575, SEQ ID 
NO: 691, SEQ ID NO: 724, SEQ ID NO: 452, SEQ ED NO: 386, SEQ ED NO: 497, SEQ 
ID NO: 712, SEQ ID NO: 591, SEQ ID NO: 638, SEQ ID NO: 740, SEQ ID NO: 697, 

20 SEQ ID NO: 569, SEQ ID NO: 470, SEQ ED NO: 700, SEQ ID NO: 586, SEQ ED NO: 

823, SEQ ED NO: 627, SEQIDNO: 627, SEQIDNO: 684, SEQIDNO: 551, SEQ ID - 
NO: 478, SEQ ID NO: 508, SEQ ID NO: 545, SEQ ID NO: 628, SEQ ED NO: 443, SEQ 
ID NO: 702, SEQ ID NO: 776, SEQ ID NO: 461 , SEQ ID NO: 737, SEQ ID NO: 809, 
SEQ ID NO: 642, SEQ ID NO: 879, SEQ ID NO: 773, SEQ ID NO: 468, SEQ ID NO: 

25 842, SEQ ID NO: 788, SEQ ID NO: 624, SEQ ID NO: 788, SEQ ED NO: 644, SEQ ED 
NO: 727, SEQ ID NO: 631, SEQ ED NO: 450, SEQ ID NO: 448, and SEQ ID NO: 653. 

In yet another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori transporter polypeptide or a fragment thereof selected from the group 
consisting of SEQ ID NO: 1670, SEQ ED NO: 1725, SEQ ID NO: 1661, SEQ ED NO: 1873, 

30 SEQ ID NO: 1753, SEQ ID NO: 1759, SEQ ID NO: 1761, SEQ ID NO: 1782, SEQ ID 
NO: 1883, SEQ ID NO: 1503, SEQ ID NO: 1542, SEQ ID NO: 1872, SEQ ED NO: 1520, . 
SEQ ED NO: 1456, SEQ ID NO: 1458, SEQ ID NO: 1617, SEQ ED NO: 1628, SEQ ID 
NO: 1644, SEQ ID NO: 1657, SEQ ID NO: 1658, SEQ ID NO: 1755, SEQ ED NO: 1756, 
SEQ ID NO: 1797, SEQ ID NO: 1799, SEQ ID NO: 1801, SEQ ID NO: 1483, SEQ ID 

35 NO: 1504, SEQ ID NO: 1532, SEQ ID NO: 1575, SEQ ID NO: 1833, SEQ ED NO: 1888, 
SEQ ID NO: 1714, SEQ ID NO: 495, SEQ ID NO: 400, SEQ ID NO: 541, SEQ ID NO: 
673, SEQ ID NO: 482, SEQ ID NO: 622, SEQ ID NO: 689, SEQ ED NO: 736, SEQ ID 
NO: 417, SEQ ID NO: 716, SEQ ID NO: 762, SEQ ID NO: 395, SEQ ID NO: 587, SEQ 
ID NO: 669, SEQ ID NO: 758, SEQ ID NO: 593, SEQ ID NO: 451, SEQ ID NO: 827, 
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SEQ ID NO: 502, SEQ ID NO: 719, SEQ ID NO: 469, SEQ ID NO: 715, SEQ ID NO: 
847, SEQ ID NO: 453, SEQ ID NO: 527, SEQ ID NO: 652, SEQ ID NO: 745, SEQ ID 
NO: 567, SEQ ID NO: 848, SEQ ID NO: 430, SEQ ID NO: 748, SEQ ID NO: 396, SEQ 
ID NO: 588, SEQ ID NO: 795, SEQ ID NO: 523, SEQ ID NO: 791, SEQ ID NO: 714, 

5 SEQ ID NO: 481, and SEQ ID NO: 765. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an if. pylori outer membrane polypeptide or a fragment thereof selected from the 
group consisting of SEQ ID NO: 1624, SEQ ID NO: 1856, SEQ ID NO: 1857, SEQ ID 
NO: 1861, SEQ ID NO: 1537, SEQ ID NO: 1773, SEQ ID NO: 1717, SEQ ID NO: 1733, 

10 SEQ ID NO: 1722, SEQ ID NO: 1659, SEQ ID NO: 1577, SEQ ID NO: 1721, SEQ ID 
NO: 1729, SEQ ID NO: 1870, SEQ ID NO: 1576, SEQ ID NO: 1632, SEQ ID NO:4867, 
SEQ ID NO: 1547, SEQ ID NO: 1533, SEQ ID NO: 1597, SEQ ID NO: 1596, SEQ ID 
NO: 1559, SEQ ID NO: 1599, SEQ ID NO: 1788, SEQ ID NO: 1789, SEQ ID NO: 1875, 
SEQ ID NO: 1451, SEQ ID NO: 1478, SEQ ID NO: 1626, SEQ ID NO: 1781, SEQ ID 

15 NO: 837, SEQ ID NO: 833, SEQ ID NO: 585, SEQ ID NO: 865, SEQ ID NO: 764, SEQ 
ID NO: 440, SEQ ID NO: 465, SEQ ID NO: 555, SEQ ID NO: 526, SEQ ID NO: 687, 
SEQ ID NO: 692, SEQ ID NO: 693, SEQ ID NO: 677, SEQ ID NO: 649, and SEQ ID NO: 
812. 

Particularly preferred is a purified or isolated H. pylori cytoplasmic polypeptide or a 

20 fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ ID 
NO: 1598, SEQ ID NO: 1739, SEQ ID NO: 1775, SEQ ID NO: 1814, SEQ ID NO: 1448, 
SEQ ID NO: 1466, SEQ ID NO: 1535, SEQ ID NO: 1545, SEQ ID NO: 1550, SEQ ID 
NO: 1680, SEQ ID NO: 1701, SEQ ID NO: 1719, SEQ ID NO: 1744, SEQ ID NO: 1790, 
SEQ ID NO: 1 859, SEQ ID NO: 1 880, SEQ ID NO: 1 885, SEQ ID NO: 1 679, SEQ ID 

25 NO: 1482, SEQ ID NO: 1485. SEQ ID NO: 1459, SEQTD NO: 1512, SEQ ID NO: 1515, 
SEQ ID NO: 1642, SEQ ID NO: 1668, SEQ ID NO: 1816, SEQ ID NO: 1845, SEQ ID 
NO: 1865, SEQ ID NO: 1866. SEQ ID NO: 1886, SEQ ID NO: 1509, SEQ ID NO: 1510, 
SEQ ID NO: 1531, SEQ ID NO: 1579, SEQ ID NO: 1584, SEQ.IDNO: 1662, SEQ ID 
NO: 1703, SEQ ID NO: 1704. SEQ ID NO: 1737, SEQ ID NO: 1740, SEQ ID NO: 1742, 

30 SEQ ID NQ: 1754, SEQ ID NO: 1 847, SEQ ID NO: 1447, SEQ ID NO: 1546, SEQ ID 
NO: 1607, SEQ ID NO: 1609. SEQ ID NO: 1610, SEQ ID NO: 1728, SEQ ID NO: 1489, 
SEQ ID NO: 1708, SEQ ID NO: 1808, SEQ ID NO: 1887,SEQIDNO: 1498, SEQ ID 
NO: 1506, SEQ ID NO: 1 592. SEQ ID NO: 1678, SEQ ID NO: 1778, SEQ ID NO: 1863, 
SEQ ID NO: 1454, SEQ ID NO: 1538, SEQ ID NO: 1567, SEQ ID NO: 1581, SEQ ID 

35 NO: 1583, SEQ ID NO: 1636. SEQ ID NO: 1639, SEQ ID NO: 1649, SEQ ID NO: 1669, 
SEQ ID NO: 1695, SEQ ID NO: 1757, SEQ ID NO: 1776, SEQ ID NO: 1848, SEQ ID 
NO: 1 849, SEQ ID NO: 1 858, SEQ ID NO: 1 884, SEQ ID NO: 1667, SEQ ID NO: 1690, 
SEQ ID NO: 1813, SEQ ID NO: 1468, SEQ ID NO: 1470, SEQ ID NO: 1811, SEQ ID 
NO: 1874, SEQ ID NO: 1876, SEQ ID NO: 1825, SEQ ID NO: 1479, SEQ ID NO: 1488, 
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SEQ ID NO: 1528, SEQ ID NO: 1566, SEQ ID NO: 1683, SEQ ID NO: 1692, SEQ ID 
NO: 1718, SEQ ID NO: 1614, SEQ ID NO: 1519, SEQ ID NO: 1476, SEQ ID NO: 1493, 
SEQ ID NO: 1497, SEQ ID NO: 1507, SEQ ID NO: 1490, SEQ ID NO: 1523, SEQ ID 
NO: 1524, SEQ ID NO: 1543, SEQ ID NO: 1551, SEQ ID NO: 1553, SEQ ID NO: 1554, 
5 SEQ ID NO: 1555, SEQ ID NO: 1562, SEQ ID NO: 1570, SEQ ID NO: 1587, SEQ ID 
NO: 1588, SEQ ID NO: 1591, SEQ ID NO: 1593, SEQ ID NO: 1684, SEQ ID NO: 1689, 
SEQ ID NO: 1694, SEQ ID NO: 1696, SEQ JD NO: 1698, SEQ ID NO: 1700, SEQ ID 
NO: 1712, SEQ ID NO: 1720, SEQ ID NO: 1730, SEQ ID NO: 1735, SEQ ID NO: 1741, 
SEQ ID NO: 1748, SEQ ID NO: 1779, SEQ ID NO: 1821, SEQ ID NO: 1823, SEQ ID 

10 NO: 1828, SEQ ID NO: 1834, SEQ ID NO: 1835, SEQ ID NO: 1836, SEQ ID NO: 1839^ 
SEQ ID NO: 1852, SEQ ID NO: 1853, SEQ ID NO: 1869, SEQ ID NO: 1871, SEQ ID 
NO: 1878, SEQ ID NO: 1521, SEQ ID NO: 1602, SEQ ID NO: 1627, SEQ ID NO: 1450, 
SEQ ID NO: 1457, SEQ ID NO: 1463, SEQ ID NO: 1469, SEQ ID NO: 1481, SEQ ID 
NO: 1484, SEQ ID NO: 1492, SEQ ID NO: 1500, SEQ ID NO: 1505, SEQ ID NO: 1508, 

15 SEQ ID NO: 1541, SEQ ID NO: 1548, SEQ ID NO: 1580, SEQ ID NO: 1590, SEQ ID 
NO: 1594, SEQ ID NO: 1603, SEQ ID NO: 1604, SEQ ID NO: 1606, SEQ ID NO: 1612, 
SEQ ID NO: 1613, SEQ ID NO: 1620, SEQ ID NO: 1621, SEQ ID NO: 1622, SEQ ID 
NO: 1631, SEQ ID NO: 1645, SEQ ID NO: 1646, SEQ ID NO: 1650, SEQ ID NO: 1651, 
SEQ ID NO: 1652, SEQ ID NO: 1653, SEQ ID NO: 1656, SEQ ID NO: 1763, SEQ ID 

20 NO: 1787, SEQ ID NO: 1800, SEQ ID NO: 1806, SEQ ID NO: 1810, SEQ ID NO: 1864, 
SEQ ID NO: 1877, SEQ ID NO: 1881, SEQ ID NO: 390, SEQ ID NO: 876, SEQ ID NO: 
547, SEQ ID NO: 678, SEQ ID NO: 729, SEQ ID NO: 786, SEQ ID NO: 654, SEQ ID 
NO: 734, SEQ ED NO: 646, SEQ ID NO: 522, SEQ ID NO: 696, SEQ ED NO: 807, SEQ 
ID NO: 683, SEQ ID NO: 790, SEQ ID NO: 763, SEQ ID NO: 806, SEQ ID NO: 799, 

25 SEQ ID NO: 434, SEQ ID NO: 743, SEQ ID NO: 804, SEQ ED NO: 733, SEQ ED NO: 
826, SEQ ID NO: 562, SEQ ED NO: 420, SEQ ID NO: 664, SEQ ED NO: 850, SEQ ID 
NO: 857, SEQ ED NO: 861, SEQ ID NO: 872, SEQ ED NO: 544, SEQ ED NO: 830, SEQ 
ED NO: 446, SEQ ID NO: 397, SEQ ED NO: 699, SEQ ED NO: 459, SEQ ID NO: 509, 
SEQ ID NO: 818, SEQ ED NO: 488, SEQ ID NO: 438, SEQ ED NO: 831, SEQ ID NO: 

30 667, SEQ ID NO: 429, SEQ ID NO: 680, SEQ ID NO: 597, SEQ ED NO: 460, SEQ ID 
NO: 709, SEQ ED NO: 822, SEQ ED NO: 466, SEQ ID NO: 584, SEQ ED NO: 388, SEQ 
ED NO: 63 1, SEQ ED NO: 787, SEQ ID NO: 532, SEQ ED NO: 619, SEQ ID NO: 723, 
SEQ ID NO: 641, SEQ. ID NO: 698, SEQ ID NO: 630, SEQ ID NO; 869, SEQ ID NO: 
601, SEQ ID NO: 415, SEQ ED NO: 542, SEQ ID NO: 704, SEQ ID NO: 572, SEQ ED 

35 NO: 467, SEQ ED NO: 399, SEQ ID NO: 579, SEQ ID NO: 739, SEQ ID NO: 849, SEQ 
ED NO: 824, SEQ ID NO: 871, SEQ ID NO: 547, SEQ ID NO: 633, SEQ ID NO: 695, 
SEQ ID NO: 405, SEQ ID NO: 394, SEQ ID NO: 761, SEQ ED NO: 574, SEQ ID NO: 
596, SEQ ID NO: 832, SEQ ID NO: 651, SEQ ID NO: 867, SEQ ID NO: 614, SEQ ID 
NO: 401, SEQ ID NO: 393, SEQ ID NO: 413, SEQ ID NO: 835, SEQ ID NO: 863, SEQ 
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ID NO: 458, SEQ ID NO: 701, SEQ ID NO: 531, SEQ ID NO: 550, SEQ ID NO: 439, 
SEQ ID NO: 516, SEQ ID NO: 802, SEQ ID NO: 581, SEQ ID NO: 535, SEQ ID NO: 
578, SEQ ID NO: 492, SEQ ID NO: 858, SEQ ID NO: 720, SEQ ID NO: 813, SEQ ID 
NO: 426, SEQ ID NO: 834, SEQ ID NO: 609, SEQ ID NO: 489, SEQ ID NO: 480, SEQ 

5 ID NO: 406, SEQ ID NO: 392, SEQ ID NO: 456, SEQ ID NO: 707, SEQ ID NO: 533, 
SEQ ID NO: 728, SEQ ID NO: 769, SEQ ID NO: 671, SEQ ID NO: 602, SEQ ID NO: 
61 8, SEQ ID NO: 61 8, SEQ ID NO: 682, SEQ ID NO: 524, SEQ ID NO: 802, SEQ ID 

NO: 785, SEQ ID NO: 457, SEQ ID NO: 781, SEQ ID NO: 473, SEQ ID NO: 384, SEQ 
ID NO: 726, SEQ ID NO: 817, SEQ ID NO: 498, SEQ ID NO: 436, SEQ ID NO: 815, 

10 SEQ ID NO: 856, SEQ. ID NO: 650, SEQ ID NO: 844, SEQ ID NO: 580, SEQ ID NO: 
783, SEQ ID NO: 416, SEQ ID NO: 741, SEQ ID NO: 442, SEQ ID NO: 803, SEQ ID 
NO: 520, SEQ ID NO: 566, SEQ ID NO: 557, SEQ ID NO: 706, SEQ ID NO: 710, SEQ 
ID NO: 487, SEQ ID NO: 603, SEQ ID NO: 472, SEQ ID NO: 476, SEQ ID NO: 770, 
SEQ ID NO: 841, SEQ ID NO: 768, SEQ ID NO: 839, SEQ ID NO: 560, SEQ ID NO: 

15 796, SEQ ID NO: 483, SEQ ID NO: 634, SEQ ID NO: 445, SEQ ID NO: 853,SEQID 
NO: 525, SEQ ID NO: 798, SEQ ID NO: 549, SEQ ID NO: 836, SEQ ID NO: 589, SEQ 
ID NO: 760, SEQ ID NO: 462, SEQ ID NO: 789, SEQ ID NO: 507, SEQ ID NO: 828, 
SEQ ID NO: 866, SEQ ID NO: 754, SEQ ID NO: 730, SEQ ID NO: 617, SEQ ID NO: 
455, SEQ ID NO: 873, SEQ ID NO: 435, SEQ ID NO: 766, SEQ ID NO: 793, SEQ ID 

20 NO: 742, SEQ ID NO: 599, SEQ ID NO: 854, and SEQ ID NO: 632. 

In one embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is 
an H, pylori polypeptide or a fragment thereof involved in energy conversion selected from 
the group consisting of SEQ ID NO: 1598, SEQ ID NO: 1739, SEQ ID NO: 1775, SEQ ID 
NO: 1814, SEQ ID NO: 390, SEQ ID NO: 876, SEQ ID NO: 547, and SEQ ID NO: 678. 

25 In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 

is an H. pylori polypeptide or a fragment thereof involved in amino acid metabolism 
selected from the group consisting of SEQ ID NO: 1448, SEQ ID NO: 1466, SEQ ID NO: 
1535, SEQ ID NO: 1545, SEQ ID NO: 1550, SEQ ID NO: 1680, SEQ ID NO: 1701, SEQ 
ID NO: 1 719, SEQ ID NO: 1744, SEQ ID NO: 1790, SEQ ID NO: 1859, SEQ ID NO: 

30 , 1880, SEQ ID NO: 1885, SEQ ID NO: 1679, SEQ ID NO: 1482, SEQ ID NO: 1485, SEQ 
ID NO: 1459, SEQ ID NO: 729, SEQ ID NO: 786, SEQ ID NO: 654, SEQ ID NO: 734, 
SEQ ID NO: 646, SEQ ID NO: 522, SEQ ED NO: 696, SEQ ID NO: 807, SEQ ID NO: 
683, SEQ ID NO: 790, SEQ ID NO: 763, SEQ ID NO: 806, SEQ ID NO: 799, SEQ ID 
NO: 434, SEQ ID NO: 743, SEQ ID NO: 804, and SEQ ID NO: 733. 

35 In yet another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 

thereof is an H. pylori polypeptide or a fragment thereof involved in nucleotide metabolism 
selected from the group consisting of SEQ ID NO: 1 5 12, SEQ ID NO: 1 5 1 5, SEQ ID NO: 
1642, SEQ ID NO: 1668, SEQ ID NO: 1816, SEQ ID NO: 1845, SEQ ID NO: 1865, SEQ 
ID NO: 1866, SEQ ID NO: 1886, SEQ ID NO: 1509, SEQ ID NO: 1510, SEQ. ID NO: 826, 
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SEQ ID NO: 562, SEQ ID NO: 420, SEQ ID NO: 664, SEQ ID NO: 850, SEQ ID NO: 
857, SEQ ID NO: 861, SEQ ID NO: 872, SEQ ID NO: 544, SEQ ID NO: 830, and SEQ ID 
NO: 446. 

In yet a further embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
5 thereof is an K pylori polypeptide or a fragment thereof involved in carbohydrate 

metabolism selected from the group consisting of SEQ ID NO: 1531, SEQ ID NO: 1579, 
SEQ ID NO: 1584, SEQ ID NO: 1662, SEQ ID NO: 1703, SEQ ID NO: 1704, SEQ ID 
NO: 1737, SEQ ID NO: 1740, SEQ ID NO: 1742, SEQ ID NO: 1754, SEQ ID NO: 1847, 
SEQ ID NO: 1447, SEQ ID NO: 397, SEQ ID NO: 699, SEQ ID NO: 459, SEQ ID NO: 
10 509, SEQ ID NO: 818, SEQ ID NO: 488, SEQ ID NO: 438, SEQ ID NO: 831, SEQ ID 
NO: 667, SEQ ID NO: 429, SEQ ID NO: 680, and SEQ ID NO: 597. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in cofactor metabolism selected 
from the group consisting of SEQ ID NO: 1546, SEQ ID NO: 1607, SEQ ID NO: 1609, 
15 SEQ ID NO: 1610, SEQ ID NO: 1728, SEQ ID NO: 1489, SEQ ID NO: 460, SEQ ID NO: 
709, SEQ ED NO: 822, SEQ ED NO: 466; SEQ ID NO: 584, and SEQ ED NO: 388. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in lipid metabolism selected 
from the group consisting of SEQ ID NO: 1708, SEQ ID NO: 1808, SEQ ID NO: 1 887, 
20 SEQ ED NO: 631, SEQ ED NO: 787, and SEQ ED NO: 532. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in mRNA translation and 
ribosome biogenesis selected from the group consisting of SEQ ED NO: 1498, SEQ ED NO: 
1506, SEQ ED NO: 1592, SEQ ED NO: 1678, SEQ ED NO: 1778, SEQ ID NO: 1863, SEQ 
25 ID NO: 619, SEQ ED NO: 723, SEQ ED NO: 641, SEQ ED NO: 698, SEQ ID NO: 630, and 
SEQ ED NO: 869. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in genome replication, 
transcription, recombination and repair selected from the group consisting of SEQ ID NO: 

30 1454, SEQ ID NO: 1538, SEQ ED NO: 1567, SEQ ID NO: 1581, SEQ ID NO: 1583, SEQ 
ID NO: 1636, SEQ ID NO: 1639, SEQ ED NO: 1649, SEQ ID NO: 1669, SEQ ID NO: 
1695, SEQ ED NO: 1757, SEQ ED NO: 1776, SEQ ID NO: 1848, SEQ ID NO: 1849, SEQ 
ID NO: 1858, SEQ ED NO: 1884, SEQ ID NO: 601, SEQ ED NO: 415, SEQ ID NO: 542, 
SEQ ID NO: 704, SEQ ED NO: 572, SEQ ID NO: 467, SEQ ID NO: 399, SEQ ID NO: 

35 579, SEQ ED NO: 739, SEQ ID NO: 849, SEQ ID NO: 824, SEQ ID NO: 871, SEQ ID 
NO: 547, SEQ ID NO: 633, SEQ ID NO: 695, SEQ ID NO: 405, SEQ ID NO: 394, and 
SEQ ID NO; 761. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in outer membrane or cell wall 
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biosynthesis selected from the group consisting of SEQ ID NO: 1667, SEQ ID NO: 1690, 
SEQ ID NO: 1813, SEQ ID NO: 1468, SEQ ID NO: 1470, SEQ ID NO: 181 1, SEQ ID 
NO: 574, SEQ ID NO: 596, SEQ ID NO: 832, SEQ ID NO: 651, SEQ ID NO: 867, SEQ 
ID NO: 614, SEQ ID NO: 401, and SEQ ID.NO: 393. 
5 In yet another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 

thereof is an H. pylori chaperone polypeptide or a fragment thereof selected from the group 
consisting of SEQ ID NO: 1874, SEQ ID NO: 1876, SEQ ID NO: 1825, SEQ ID NO: 413, 
SEQ ID NO: 835, and SEQ ID NO: 863. 

Particularly preferred is a purified or isolated H. pylori secreted or periplasmic 
1 0 polypeptide or a fragment thereof, wherein the polypeptide is selected from the group 

consisting of SEQ ID NO: 1455, SEQ ID NO: 1589, SEQ ID NO: 1518, SEQ ID NO: 1529, 
SEQ ID NO: 1765, SEQ ID NO: 1770, SEQ ID NO: 1 829, SEQ ID NO: 1556, SEQ ID 
NO: 1565, SEQ ID NO: 1569, SEQ ID NO: 1571, SEQ ID NO: 1574, SEQ ID NO: 1578, 
SEQ ID NO: 1663, SEQ ID NO: 1674, SEQ ID NO: 1676, SEQ ID NO: 1697, SEQ ID 
15 NO: 1699, SEQ ID NO: 1710, SEQ ID NO: 1715, SEQ ID NO: 1716, SEQ ID NO: 1732, 
SEQ ID NO: 1736, SEQ ID NO: 1745, SEQ ID NO: 1749, SEQ ID NO: 1750, SEQ ID 
NO: 1766, SEQ ID NO: 1767, SEQ ID NO: 1768, SEQ ID NO: 1769, SEQ ID NO: 1795, 
SEQ ID NO: 1802,SEQIDNO: 1804, SEQ ID NO: 1824, SEQ ID NO: 1831,SEQID 
NO: 1838, SEQ ID NO: 1840, SEQ ID NO: 1844, SEQ ID NO: 1862, SEQ ID NO: 1879, 
20 SEQIDNO: 1882,SEQDDNO: 1890, SEQ ID NO: 1494, SEQ ID NO: 1634,SEQID 
NO: 1 635, SEQ ID NO: 1647, SEQ ID NO: 1648, SEQ ID NO: 1654, SEQ ID NO: 1446, 
SEQ ID NO: 1449, SEQ ID NO: 1452, SEQ ID NO: 1473, SEQ ID NO: 1474, SEQ ID 
NO: 1480, SEQ ID NO: 1491, SEQ ID NO: 1502, SEQ ID NO: 1513, SEQ ID NO: 1605, 
SEQ ID NO: 1771, SEQ ID NO: 1526, SEQ ID NO: 1557, SEQ ED NO: 1560, SEQ ED 
25 NO: 1 585, SEQ ID NO: 1672, SEQ ID NO: 1677, SEQ ED NO: 1686, SEQ ID NO: 1752, 
SEQ ID NO: 1762, SEQ ID NO: 1777, SEQ ID NO: 1792, SEQ ID NO: 1805, SEQ ID 
NO: 1815, SEQ ID NO: 1817, SEQ ID NO: 1827, SEQ ED NO: 1842, SEQ ID NO: 1846, 
SEQ ID NO: 1896, SEQ ED NO: 1530, SEQ DO NO: 1637, SEQ ID NO: 1461, SEQ ID 
NO: 1467, SEQ ID NO: 1623, SEQ ID NO: 1625, SEQ ID NO: 530, SEQ ID NO: 708, 
30 SEQ ID NO: 414, SEQ ID NO: 694, SEQ ID NO: 703, SEQ ID NO: 721, SEQ ID NO: 
749, SEQ ID NO: 685, SEQ ID NO: 444, SEQ ID NO: 606, SEQ ID NO: 582, SEQ ID 
NO: 621, SEQ ID NO: 868, SEQ ID NO: 666, SEQ ID NO: 408, SEQ ID NO: 538, SEQ 
ID NO: 573, SEQ ID NO: 639, SEQ ID NO: 668, SEQ ID NO: 524, SEQ ID NO: 422, 
SEQ ID NO: 819, SEQ ID NO: 611, SEQ ID NO: 674, SEQ ID NO: 577, SEQ ID NO: 
35 663, SEQ ID NO: 558, SEQ ID NO: 794, SEQ ID NO: 564, SEQ ID NO: 592, SEQ ID 
NO: 814, SEQ ID NO: 398, SEQ ID NO: 767, SEQ ID NO: 425, SEQ ID NO: 659, SEQ 
ID NO: 517, SEQ ID NO: 539, SEQ ID NO: 475, SEQ ID NO: 615, SEQ ID NO: 665, • 
SEQ ID NO: 607, SEQ ID NO: 598, SEQ ID NO: 759, SEQ ID NO: 752, SEQ ID NO: 
595, SEQ ID NO: 686, SEQ ID NO: 528, SEQ ID NO: 705, SEQ ID NO: 828, SEQ ID 
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NO: 403, SEQ ID NO: 561, SEQIDNO; 500, SEQ ID NO: 491, SEQ ID NO: 846, SEQ 
• ID NO: 732, SEQ ID NO: 778, SEQ ID NO: 751, SEQ ID NO: 744, SEQ ID NO: 504, 
SEQ ID NO: 419, SEQ ID NO: 792, SEQ ID NO: 825, SEQ ID NO: 756, SEQ ID NO: 
519, SEQ ID NO: 870, SEQ ID NO: 777, SEQ ID NO: 808, SEQ ID NO: 506, SEQ ID 

5 NO: 864, SEQ ID NO: 655, SEQ ID NO: 407, SEQ ID NO: 427, SEQ ID NO: 774, SEQ 
ID NO: 797, SEQ ID NO: 688, SEQ ID NO: 815, SEQ ID NO: 71 8, SEQ ID NO: 859, 
SEQ ID NO: 775, SEQ ID NO: 874, SEQ ID NO: 543, SEQ ID NO: 878, SEQ ID NO: 
594, SEQ ID NO: 610, and SEQ ID NO: 600. 

Particularly preferred is a purified or isolated H. pylori surface or membrane 

10 polypeptide or a fragment thereof, wherein the polypeptide is selected from the group 

consisting of SEQ ID NO: 151 1, SEQ ID NO: 1561, SEQ ID NO: 1563, SEQ ID NO: 1681, 
SEQ ID NO: 1 71 1 , SEQ ID NO: 1 73 1 , SEQ ID NO: 1 743, SEQ ID NO: 1 747, SEQ ID 
NO: 1758, SEQ ID NO: 1 893, SEQ ID NO: 1895, SEQ ID NO: 1573, SEQ ID NO: 1705, 
SEQ ID NO: 1707, SEQ ID NO: 1723, SEQ ID NO: 1726, SEQ ID NO: 1760, SEQ ID 

15 NO: 1764, SEQ ID NO: 1798, SEQIDNO: 1803.SEQIDNO: 1807, SEQIDNO: 1889, 
SEQ ID NO: 1 892, SEQ ID NO: 1460, SEQ ID NO: 1477, SEQ ID NO: 1499, SEQ ID 
NO: 1514, SEQIDNO: 1641, SEQIDNO: 1534, SEQ ID NO: 1564,SEQIDNO: 1673, 
SEQIDNO: 1746, SEQIDNO: 1794, SEQIDNO: 1843, SEQ ID NO: 1894, SEQ ID 
NO: 1536, SEQIDNO: 1544, SEQIDNO: 1568, SEQ ID NO: 1572, SEQIDNO: 1582, 

20 SEQ ID NO: 1738, SEQ ID NO: 1891, SEQ ID NO: 1660, SEQ ID NO: 1793, SEQ ID 
NO: 1832, SEQ ID NO: 1841, SEQ ID NO: 1860, SEQ ID NO: 1486, SEQ ID NO: 1465, 
. SEQ ID NO: 1 539, SEQ ID NO: 1693, SEQ ID NO: 1629, SEQ ID NO: 1 540, SEQ ID 
NO: 1791, SEQ ID NO: 1525, SEQ ID NO: 1558, SEQ ID NO: 1655, SEQ ID NO: 1517, 
SEQ ID NO: 875, SEQ ID NO: 676, SEQ ID NO: 801, SEQ ID NO: 860, SEQ ID NO: 

25 747, SEQ ID NO: 529, SEQ ID NO: 387, SEQ ID NO: 391, SEQ ID NO: 515, SEQ ID 
NO: 625, SEQ ID NO: 568, SEQ ID NO: 454, SEQ ID NO: 800, SEQ ID NO: 499, SEQ 
ID NO: 779, SEQ ID NO: 648, SEQ ID NO: 643, SEQ ID NO: 537, SEQ ID NO: 51 1, 
SEQ ID NO: 616, SEQ ID NO: 829, SEQ ID NO: 546, SEQ ID NO: 780, SEQ ID NO: 
553, SEQ ID NO: 549, SEQ ID NO: 410, SEQ ID NO: 608, SEQ ID NO: 513, SEQ ID 

30 NO: 656, SEQ ID NO: 845, SEQ ID NO: 563, SEQ ID NO: 570, SEQ ID NO: 805, SEQ 
ID NO: 851, SEQ ID NO: 423, SEQ ID NO: 810, SEQ ID NO: 604, SEQIDNO: 636, 
SEQ ID NO: 757, SEQ ID NO: 412, SEQ ID NO: 816, SEQ ID NO: 771, SEQ ID NO: 
418, SEQ ID NO: 662, SEQ ID NO: 512, SEQ ID NO: 679, SEQ ID NO: 605, SEQ ID 
NO: 421, SEQ ID NO: 552, SEQ ID NO: 576, SEQ ID NO: 571, SEQ ID NO: 725, SEQ 

35 ID NO: 565, SEQ ID NO: 71 7, SEQ ID NO: 536, SEQ ID NO: 647, SEQ ID NO: 501, 
. SEQ ID NO: 838, SEQ ID NO: 428, SEQ ID NO: 494, SEQ ID NO: 479, SEQ ID NO: 
71 1, SEQ ID NO: 722 ,SEQ ID. NO: 645, SEQ ID NO: 670, SEQ ID NO: 746, SEQ ID 
NO: 493, SEQ ID NO: 540, SEQ ID NO: 612, SEQ ID NO: 629, SEQ ID NO: 484, SEQ 
ID NO: 485, SEQ ID NO: 486, SEQ ID NO: 433, SEQ ID NO: 385, and SEQ ID NO: 409. 
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In one embodiment, the H. pylori, surface or membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof having at least one membrane 
spanning region selected from the group consisting of SEQ ID NO: 151 1, SEQ ID NO: 
1561.SEQ ID NO: 1563, SEQ ID NO: 1681, SEQ ID NO: 1711, SEQ ID NO: 1731 SEQ 
5 ID NO: 1743, SEQ ID NO: 1747, SEQ ID NO: 1758, SEQ ID NO: 1893, SEQ ID NO- 
1895, SEQ ID NO: 875, SEQ ID NO: 676, SEQ ID NO: 801, SEQ ID NO: 860 SEQ ID 
NO: 747, SEQ ID NO: 529, SEQ ID NO: 387, SEQ ID NO: 391, SEQ ID NO: 515, SEQ 
ID NO: 625, SEQ ID NO: 568, SEQ ID NO: 454, SEQ ID NO: 800, SEQ ID NO- 499 
SEQ ID NO: 779, and SEQ ID NO: 385. 

10 In another embodiment, the H. pylori surface or membrane polypeptide or a 

fragment thereof is an H. pylori polypeptide or a fragment thereof having at least two 
membrane spanning regions selected from the group consisting of SEQ ID NO: 1573 SEQ 
©NO: 1705,SEQIDNO: 1707, SEQ ID NO: 1723, SEQ ID NO: 1726,SEQIDNO- 
1760.SEQIDNO: 1764, SEQ ID NO: 1798, SEQ ID NO: 1803.SEQIDNO: 1807 SEQ 

15 ID NO: 1889, SEQ ID NO: 1 892, SEQ ID NO: 1460, SEQ ID NO: 1477, SEQ ID NO- 
1499, SEQ ID NO: 1514, SEQ ID NO: 1641, SEQ ID NO: 648, SEQ ID NO: 643, SEQ ID 
NO: 537, SEQ ID NO: 51 1, SEQ ID NO: 616, SEQ ID NO: 829, SEQ ID NO: 546 SEQ 
ID NO: 780, SEQ ID NO: 553, SEQ ID NO: 549, SEQ ID NO: 410, SEQ ID NO: 608 
SEQ ID NO: 513, SEQ ID NO: 656, SEQ ID NO: 845, SEQ ID NO: 563, SEQ ID NO- 

20 570, SEQ ID NO: 805, and SEQ ID NO: 851. 

In yet another embodiment, the H. pylori surface or membrane polypeptide or a 
fragment thereof is an H. pylori polypeptide or a fragment thereof having at least three 
membrane spanning regions selected from the group consisting of SEQ ID NO: 1534, SEQ 
IDNO: 1564,SEQIDNO: 1673, SEQ ID NO: 1746.SEQIDNO: 1794, SEQ ID NO: 

25 1843, SEQ ID NO: 1894, SEQ ID NO: 423, SEQ ID NO: 810, SEQ ID NO: 604, SEQ ID 
NO: 636, SEQ ID NO: 757, SEQ ID NO: 412, and SEQ ID NO: 816. 

In yet a further embodiment, the K pylori surface or membrane polypeptide or a 
fragment thereof is an H. pylori polypeptide or a fragment thereof having at least four 
membrane spaiining regions selected from the group consisting of SEQ ID NO: 1536, SEQ 

30 ID NO: 1544, SEQ ID NO: 1568, SEQ ID NO: 1572, SEQ ID NO: 1582, SEQ ED NO: 

1738, SEQ ID NO: 1891, SEQ IDNO: 1660, SEQ ID NO: 771, SEQ ID NO: 4 18, SEQ ID 
NO: 662, SEQ ID NO: 512, SEQ ID NO: 679, SEQ ID NO: 605, SEQ ID NO: 421, SEQ 
ID NO: 552, SEQ ID NO: 576, SEQ ID NO: 571. 

In another embodiment, the H. pylori surface or membrane polypeptide or a 

35 fragment thereof is an H. pylori polypeptide or a fragment thereof having at least five 

membrane spanning regions selected from the group consisting of SEQ ID NO: 1793, SEQ 
IDNO: 1832, SEQ ID NO: 1841, SEQ IDNO: 1860, SEQ ID NO: 1486, SEQ ID NO: 725, 
SEQ ID NO: 565, SEQ ID NO: 717, SEQ ID NO: 536, SEQ ID NO: 647, and SEQ ID NO-' 
409. 
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In another embodiment, the H. pylori surface or membrane polypeptide or a 
fragment thereof is an//, pylori polypeptide or a fragment thereof having at least six 
membrane spanning regions selected from the group consisting of SEQ ID NO: 1465, SEQ 
ID NO: 1539, SEQ ID NO: 1693, SEQ ID NO: 1629, SEQ ID NO: 501, SEQ ID NO: 838, 
5 SEQ ID NO: 428, SEQ ID NO: 494, SEQ ID NO: 479, SEQ ID NO: 71 1 , and SEQ ID NO: 
722. 

In yet another embodiment, the H. pylori surface or membrane polypeptide or a 
fragment thereof is an H. pylori polypeptide or a fragment thereof having at least seven 
membrane spanning regions selected from the group consisting of SEQ ID NO: 1 540, SEQ 

10 ID NO: 1791, SEQ ID NO: 1525, SEQ ID NO: 1558, SEQ ID NO: 1655, SEQ ID NO: 
1517, SEQ ID NO: 645, SEQ ID NO: 670, SEQ ID NO: 746, SEQ ID NO: 493, SEQ ID 
NO: 540, SEQ ID NO: 612, SEQ ID NO: 629, SEQ ID NO: 484, SEQ ID NO: 485, SEQ 
ID NO: 486, and SEQ ID NO: 433. 

In another aspect, the invention pertains to any individual H. pylori polypeptide 

' 15 member or nucleic acid encoding such a member from the above-identified groups of H. 
pylori polypeptides. 

* 

In another aspect, the invention features nucleic acids capable of binding mRNA of 
H. pylori. Such nucleic acid is capable of acting as antisense nucleic acid to control the 
* translation of mRNA of H. pylori. A further aspect features a nucleic acid which is capable 
20 of binding specifically to an H. pylori nucleic acid. These nucleic acids are also referred to 
herein as complements and have utility as probes and as capture reagents. 

In another aspect, the invention features an expression system comprising an open 
reading frame corresponding to K pylori nucleic acid. The nucleic acid further comprises a 
control sequence compatible with an intended host. The expression system is useful for 
25 making polypeptides corresponding to H. pylori nucleic acid. 

In another aspect, the invention features a cell transformed with the expression 
system to produce H. pylori polypeptides. 

In another aspect, the invention features a method of generating antibodies against 
K pylori polypeptides which are capable of binding specifically to H. pylori polypeptides. 
30 Such antibodies have utility as reagents for immunoassays to evaluate the abundance and 
distribution of H. /^/on-specific antigens. 

In another aspect, the invention features a method of generating vaccines for 
immunizing an individual against H. pylori. The method includes: immunizing a subject 
with an H. pylori polypeptide, e.g., a surface or secreted polypeptide, or active portion 
35 thereof, and a pharmaceutical^ acceptable carrier. Such vaccines have therapeutic and 
prophylactic utilities. 

In another aspect, the invention provides a method for generating a vaccine 
comprising a modified immunogenic H. pylori polypeptide, e.g., a surface or secreted 
polypeptide, or active portion thereof, and a pharmacologically acceptable carrier. 
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In another aspect, the invention features a method of evaluating a compound, e.g. a ' 
polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an H. pylori 
polypeptide. The method includes: contacting the candidate compound with an H. pylori 
polypeptide and determining if the compound binds or otherwise interacts with an H. pylori 
5 polypeptide. Compounds which bind R pylori are candidates as activators or inhibitors of 
the bacterial life cycle. These assays can be performed in vitro or in vivo. 

In another aspect, the invention features a method of evaluating a compound, e.g. a 
polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an H. pylori 
nucleic acid, e.g., DNA or RNA. The method includes: contacting the candidate 

10 compound with an K pylori nucleic acid and determining if the compound binds or 

otherwise interacts with an H. pylori polypeptide. Compounds which bind H. pylori are 
candidates as activators or inhibitors of the bacterial life cycle. These assays can be 
performed in vitro or in vivo. 

The invention features H. pylori polypeptides, preferably a substantially pure 

1 5 preparation of an H. pylori polypeptide, or a recombinant H. pylori polypeptide. In 

preferred embodiments: the polypeptide has biological activity; the polypeptide has an \ 
amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% identical to an 
amino acid sequence of the invention contained in the Sequence Listing, preferably it has 
about 65% sequence identity with an amino acid sequence of the invention contained in the 

20 Sequence Listing, and most preferably it has about 92% to about 99% sequence identity 
with an amino acid sequence of the invention contained in the Sequence Listing; the 
polypeptide has an amino acid sequence essentially the same as an amino acid sequence of 
the invention contained in the Sequence Listing; the polypeptide is at least 5, 10, 20, 50, 
100, or 150 amino acid residues in length; the polypeptide includes at least 5, preferably at 

25 least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous 
amino acid residues of the invention contained in the Sequence Listing. In yet another 
preferred embodiment, the amino acid sequence which differs in sequence identity by about 
7% to about 8% from the H. pylori amino acid sequences of the invention contained in the 
Sequence Listing is also encompassed by the invention. 

30 In preferred embodiments: the H. pylori polypeptide is encoded by a nucleic acid of 

the invention contained in the Sequence Listing, or by a nucleic acid having at least 60%, 
70%, 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid of the invention 
contained in the Sequence Listing. 

In a preferred embodiment, the subject K pylori polypeptide differs in amino acid 

35 sequence at 1, 2, 3, 5, 10 or more residues from a sequence of the invention contained in the 
Sequence Listing. The differences, however, are such that the H. pylori polypeptide 
exhibits an H. pylori biological activity, e.g., the K pylori polypeptide retains a biological 
activity of a naturally occurring # pylori enzyme. 



WO 96/40893 



PCTAJS96/09122 



-32- * 

In preferred embodiments, the polypeptide includes all or a fragment of an amino 
acid sequence of the invention contained in the Sequence Listing; fused, in reading framed 
to additional amino acid residues, preferably to residues encoded by genomic DNA 5' or 3' 
to the genomic DNA which encodes a sequence of the invention contained in the Sequence 
5 Listing. 

In yet other preferred embodiments, the H. pylori polypeptide is a recombinant 
fusion protein having a first H. pylori polypeptide portion and a second polypeptide 
portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to H. 
pylori. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a 
1 0 DNA binding domain, or a polymerase activating domain. In preferred embodiment the 
fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
transcription events, alternative UNA splicing events, and alternative translational and 
postranslational events. 

15 The invention also encompasses an immunogenic component which includes an K 

pylori polypeptide in an immunogenic preparation; the immunogenic component being 
capable of eliciting an immune response specific for the H. pylori polypeptide, e.g., a 
humoral response, an antibody response, or a cellular response. In preferred embodiments, 
the immunogenic component comprises at least one antigenic determinant from a 

20 polypeptide of the invention contained in the Sequence Listing. 

In another aspect, the invention provides a substantially pure nucleic acid having a 
nucleotide sequence which encodes an H. pylori polypeptide. In preferred embodiments: 
the encoded polypeptide has biological activity; the encoded polypeptide has an amino acid 
sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous to an amino acid 

25 sequence of the invention contained in the Sequence Listing; the encoded polypeptide has 
an amino acid sequence essentially the same as an amino acid sequence of the invention 
contained in the Sequence Listing; the encoded polypeptide is at least 5, 1 0, 20, 50, 1 00, or 
150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 
10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino 

30 acids of the invention contained in the Sequence Listing. 

In preferred embodiments: the nucleic acid of the invention is that contained in the 
Sequence Listing; the nucleic acid is at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% 
homologous with a nucleic acid sequence of the invention contained in the Sequence 
Listing. 

35 In a preferred embodiment, the encoded H. pylori polypeptide differs (e.g.,by 

amino acid substitution, addition or deletion of at least one amino acid residue) in amino 
acid sequence at 1, 2, 3, 5, 10 or more residues, from a sequence of the invention contained 
in the Sequence Listing. The differences, however, are such that: the H. pylori encoded 
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polypeptide exhibits a K pylori biological activity, e.g., the encoded H. pylori enzyme 
retains a biological activity of a naturally occurring K pylori. 

In preferred embodiments, the encoded polypeptide includes all or a fragment of an 
amino acid sequence of the invention contained in the Sequence Listing; fused, in reading 
5 frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 
5' or 3' to the genomic DNA which encodes a sequence of the invention contained in the 
Sequence Listing. 

In preferred embodiments, the subject H. pylori nucleic acid will include a 
transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
1 0 transcriptional enhancer sequence, operably linked to the K pylori gene sequence, e.g., to 
render the H. pylori gene sequence suitable for expression in a recombinant host cell. 

In yet a further preferred embodiment, the nucleic acid which encodes an K pylori 
polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe 
corresponding to at least 8 consecutive nucleotides of the invention contained in the 
15 Sequence Listing; more preferably to at least 12 consecutive nucleotides of the invention 
contained in the Sequence Listing; more preferably to at least 20 consecutive nucleotides of 
the invention contained in the Sequence Listing; more preferably to at least 40 consecutive 
nucleotides of the invention contained in the Sequence Listing. 

In a preferred embodiment, the nucleic acid encodes a peptide which differs by at 
20 least one amino acid residue from the sequences of the invention contained in the Sequence 
Listing. 

In a preferred embodiment, the nucleic acid differs by at least one nucleotide from a 
nucleotide sequence of the invention contained in the Sequence Listing which encodes 
amino acids of the invention contained in the Sequence Listing. 

25 In another aspect, the invention encompasses: a vector including a nucleic acid 

which encodes an H. pylori polypeptide or an H. pylori polypeptide variant as described 
herein; a host cell transfected with the vector; and a method of producing a recombinant H. 
pylori polypeptide or K pylori polypeptide variant; including culturing the cell, e.g.„ in a 
cell culture medium, and isolating the K pylori or K pylori polypeptide variant, eig., from 

30 the cell or from the cell culture medium. 

In another aspect, the invention features, a purified recombinant nucleic acid having 
at least 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a sequence of the 
invention contained in the Sequence Listing. 

The invention also provides a probe or primer which includes a substantially 

35 purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence 
which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense 
or antisense sequence of the invention contained in the Sequence Listing, or naturally 
occurring mutants thereof. In preferred embodiments, the probe or primer further includes 
a label group attached thereto. The label group can be, e.g., a radioisotope, a fluorescent 
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compound, an enzyme, and/of an enzyme co-factor. Preferably the oligonucleotide is at 
least 10 and less than 20, 30, 50, 100, or 150 nucleotides in length. 

The invention further provides nucleic acids, e.g., RNA or DNA, encoding a 
polypeptide of the invention. This includes double stranded nucleic acids as well as coding 
5 and antisense single strands. 

The H. pylori strain, from which genomic sequences have been sequenced, has been 
deposited in the American Type Culture Collection(ATCC # 55679) as strain HP-J99. 

Included in the invention are: allelic variations; natural mutants; induced mutants; 
proteins encoded by DNA that hybridizes under high or low stringency conditions to a 
1 0 nucleic acid which encodes a polypeptide of the invention contained in the Sequence 
Listing (for definitions of high and low stringency see Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by 
reference); and, polypeptides specifically bound by antisera to H. pylori polypeptides, 
especially by antisera to an active site or binding domain ofH. pylori polypeptide. The 
15 invention also includes fragments, preferably biologically active fragments. These and 
other polypeptides are also referred to herein as H. pylori polypeptide analogs or variants. 

Putative functions have been determined for several of the K pylori polypeptides of 
the invention, as shown in Table 1 . 

Accordingly, uses of the claimed H. pylori polypeptides in these identified functions 
20 are also within the scope of the invention. 1 

In addition, the present invention encompasses H. pylori polypeptides characterized 
as shown in Table 1 below, including: H. pylori cell envelope proteins, H. pylori 
periplasmic/secreted proteins, H. pylori cytoplasmic proteins, and other H. pylori surface 
and membrane proteins. Members of these groups were identified by BLAST homology 
25 searches and by searches for secretion signal or transmembrane protein motifs. 

(Polypeptides in the same row of Table 1 , i.e., rows 1 and 3, or rows 2 and 4, are related to 
one another as described in Table 3 below.) 
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TABLE OF FUNCTIONAL GROUPS 




nt 


aa 




nt 


aa 


. ORF Name 


SeqID 


SeqID 


ORF Name 


SeqID 


SeqID 




# 


. # 




# 


# 


Row 


1 


2 




3 


4 


A. CELL ENVELOPE 


A.1 . Flagella-associated 


01gpl0401orfl 


1020 


1471 


26588588.aa 


217 


660 


01gpl0401orf5 


1021 


1472 


26588588.aa 


217 


660 


02aell612orf21 


1036 


1487 


6288949.aa 


367 


855 


02cel0213orf7 


1050 


1501 


22692187.aa 


911 


534 


02ge20116orf34 


1071 


1522 


29454837.aa 


944 


675 


Q4gell713orf5 


1101 


1552 


1171928.aa 


18 


404 


04gell713oif5 


1101 


1552 


21699087.aa 


107. 


518 


05ep20322orfll 


1135 


1586 


16219090.aa 


894 


464 


12ge20305orfll 


1276 


1727 


29298130.aa 


943 


672 


06cp20302orfl2 


1150 


1601 ' 


25525277.aa 


203 


640 


07ge20415or£27 


1187 


1638 


19557055.aa 


85 


490 


07ge20415orf27 
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1638 


36111066.aa 


290 


755 


07ge31107orf2 


1192 


1643 


104792.aa 


5 


389 


29zpl0241orf6 


1361 


1812 


24882763.aa 
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635 


hp2el0911orf5 


1379 


1830 


917152.aa 
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877 


hp3elll22orfl 
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25478375.aa 


934 
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1403 


1854 


16984442.aa 
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477 


hpelll22orf5 


1400 


1851 


3942217.aa . 
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772 


07ge20415orf34 


1189 


1640 


26380318.aa 
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658 


A.2. Inner membrane proteins 
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1453 
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1213 


1664 


16131887.aa 


893 


463 
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97 
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22 
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49 
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782 
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691 
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14713512.aa 


57 
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2 
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92 


497 
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1671 


32704686.aa 


255 


712 
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638 
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34666680.aa 
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245 


697 
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627 


06epl0306orG 


1160 
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946 
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i 1 
1751 
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551 
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mi 
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76 


478 
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66 
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34489543.aa 
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14gpl2015orfl6 
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4698838.aa 
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809 
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70 
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727 
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631 
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1044 


1495 


I4645905.aa 


55 


450 
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1273 


1724 
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54 


448 
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2626l040.aa 
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A.3. Transporters 
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90 
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15 
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79 
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30662792.aa 


238 
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274 
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14gpl2015orfl4 
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1782 


12617677.aa 


27 
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1883 


33203 192.aa 


258 
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1432 


1883 


36573502.aa 
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1052 


1503 


1071890.aa 


10 
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24215.aa 
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04epl0811orf4 
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28971 l.aa 
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24238762.aa 
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1456 


1464715.aa 


56 
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4882763.aa 
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71 
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58 
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1304 


1755 
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26258562.aa 
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652 
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960 
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1348 
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40 
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748 
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1053 
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11 
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974 
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17787558.aa 


78 
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A.4. Outer membrane proteins 
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A.5. Other cell envelope proteins 
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B. CYTOPLASAMIC PROTEINS 


B.l . Proteins involved in energy conversion 
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B.2. Proteins involved in amino acid metabolism 


0lael202lorfl 


997 


1448 


34109763.aa 
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B.3. Proteins involved in nucleotide metabolism 
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B.4. Proteins involved in carbohydrate metabolism 
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B.5. Proteins involved in cofactor metabolism 
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B.6. Proteins involved in lipid metabolism 
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B.7. Proteins involved in mRN A translation and 
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B.8. Proteins involved in genome replication. 


transcription, recombination& repair 
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B.9. Proteins involved in outer membrane or 
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C. SECRETED OR PERIPLASMIC PROTEINS 


C.l. Secreted or periplasmic proteins 
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C.2. Proteins likely to be secreted or periplasmic 
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D. OTHER SURFACE AND MEMBRANE. PROTEINS 


D.l . Proteins likely to contain a single membrane 


spanning region 
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D.2. Proteins likely to contain two membrane spans 
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D.3. Proteins likely to contain 3 membrane spanning regions 
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1295 


1746 


25398250.aa 


200 


636 


16epl0117orf7 


1343 


1794 


36134661.aa 


292 


757 


hp3ell024orfl7 


1392 


1843 


1206675.aa 


23 


412 


hp6pl0723orf43 


1443 


1894 


4744128.aa 


336 


816 


D.4. Proteins likely to contain 4 membrane spanning regions 


03xell215orf5 


1085 


1536 


3933437.aa 


301 


771 


04ep71403orfl2 


1093 


1544 


12694087.aa 


28 


418 


05apll505orfl0 


1117 


1568 


26758437.aa . 


941 


662 


05cpll911orfl3 


1121 


1572 


21511555.aa 


103 


.512 


05cpll911orfl3 


1121 


1572 


29531590.aa 


231 


679 


05cp20518orf51 


1131 


1582 


24409641. aa 


176 


605 


13eel0216orf5 


1287 


1738 


1296921 8.aa 


31 


421 


13eel0216orf5 


1287 


1738 


23494043.aa 


917 


552 


hp5pl5641orf9 


1440 


1891 


23945317.aa 


151 


576 


09aell601orfll 


1209 


1660 


23867687.aa 


922 


571 


D.5. Proteins likely to contain 5 membrane spanning regions 


16epl0117orf6 


1342 


1793 








hp2pl0625orfl4 


1381 


1832 


33986087.aa 


265 


725 


hp3el0349or£25 


1390 


1841 


23631317.aa 


142 


565 


hp3pl0349orG2 


1409 


1860 


33218912.aa 


259 


717 


02aell612orfl4 


1035 


1486 


23437502.aa 


122 


536 








25995917.aa 


206 


647 








11878127.aa 


885 


409 


D.6. Proteins likely to contain 6 membrane spanning regions 


01cpll710orf34 


1014 


1465 


20423 12.aa 


96 


501 


01cpll710orf34 


1014 


1465 


5083577.aa 


353 


838 


04epl0206orf22 


1088 


1539 


1370471 8.aa 


38 


428 
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04epl0206oi£22 


1088 


1539 


20023400.aa 


89 


494 


Ilgel0309ozfl8 


1242 


1693 


• 17089217.aa 


77 


479 


07cp21714orfl4 


1178 


1629 


32663212.aa 


.954 


711 


07cp21714orfl4 


1178 


1629 


3360130.aa 


264 


722 


D.7. Proteins likely to contain 7 or more membrane spanning regions 


04epl0206orf23 


1089 


1540 


25976418.aa 


937 


645 


04epl0206orf23 


1089 


1540 


2915903.aa 


226 


670 


16ael0508orf3 


1340 


1791 


35360843.aa 


283 


746 


03ap21820orfl3 


1074 


1525 


197166.aa 


88 


493 


03ap21820orfl3 


1074 


1525 


234391.aa 


125 


540 


03ap21820orfl3 


1074 


1525 


24417212.aa 


183 


612 


05ae20220orf31 


1107 


1558 


24798427.aa 


195 


629 


07gpll807orf49 


1204 


1655 


19531291.aa 


81 


484 


07gpll807orf49 


1204 


1655 


19536375.aa 


901 


485 


02ep30607orGl 


1066 


1517 


19536458.aa 


82 


486 


02ep30607orf31 


1066 


1517 


13865928.aa 


42 


433 



Pn Table 1, "nt" represents nucleotide Seq. ID number and "aa" represents amino 
Seq. ID number] 



Definitions 

5 A purified or isolated polypeptide or a substantially pure preparation of a 

polypeptide are used interchangeably herein and, as used herein, mean a polypeptide that 
has been separated from other proteins, lipids, and nucleic acids with which it naturally 
occurs. Preferably, the polypeptide is also separated from substances, e.g., antibodies or 
gel matrix, e.g., polyacrylamide, which are used to purify it Preferably, the polypeptide 

10 constitutes at least 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. 

Preferably, the preparation contains: sufficient polypeptide to allow protein sequencing; at 
least 1, 10, or 100 ng of the polypeptide; at least 1, 10, or 100 mg of the polypeptide. 

- A purified preparation of cells refers to, in the case of plant or animal cells, an in 
vitro preparation of cells and not an entire intact plant or animal. In the case of cultured 

15 cells or microbial cells, it consists of a preparation of at least 10% and more preferably 50% 
of the subject cells. 

A purified or isolated or a substantially pure nucleic acid, e.g., a substantially pure 
DNA, (are terms used interchangeably herein) is a nucleic acid which is one or both of the 
following: not immediately contiguous with both of the coding sequences with which it is 

20 immediately contiguous (i.e., one at the 5' end arid one at the 3' end) in the naturally- 
occurring genome of the organism from which the nucleic acid is derived; or which is 
substantially free of a nucleic acid with which it occurs in the organism from which the 
nucleic acid is derived. The term includes, for example, a recombinant DNA which is 
incorporated into a vector, e.g., into an autonomously replicating plasmid or virus, or into 

25 the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule 
(e.g., a cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease 
treatment) independent of other DNA sequences. Substantialfv prce DNA also includes a 
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recombinant DNA which is part of a hybrid gene encoding additional H. pylori DNA 
sequence. 

A "contig" as used herein is a nucleic acid representing a continuous stretch of 
genomic sequence of an organism. 
5 An "open reading frame", also referred to herein as ORF, is a region of nucleic acid 

which encodes a polypeptide. This region may represent a portion of a coding sequence or 
a total sequence and can be determined from a stop to stop codon or from a start to stop 
codon. 

As used herein, a "coding sequence" is a nucleic acid which is transcribed into 
1 0 messenger RNA and/or translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are determined 
by a translation start codon at the five prime terminus and a translation stop code at the 
three prime terminus. A coding sequence can include but is not limited to messenger RNA, 
synthetic DNA, and recombinant nucleic acid sequences. 
15 A "complement" of a nucleic acid as used herein referes to an anti-parallel or 

antisense sequence that participates in Watson-Crick base-pairing with the original 
sequence. 

A "gene product" is a protein or structural RNA which is specifically encoded by a 

gene. 

20 As used herein, the term "probe" refers to a nucleic acid, peptide or other chemical 

entity which specifically binds to a molecule of interest Probes are often associated with 
or capable of associating with a label. A label is a chemical moiety capable of detection. 
Typical labels comprise dyes, radioisotopes, luminescent and chemiluminescent moieties, 
fluorophores, enzymes, precipitating agents, amplification sequences, and the like. 

25 Similarly, a nucleic acid, peptide or other chemical entity which specifically binds to a 

molecule of interest and immobilizes such molecule is referred herein as a "capture ligand". 
Capture ligands are typically associated with or capable of associating with a support such 
as nitro-cellulose, glass, nylon membranes, beads, particles and the like. The specificity of 
hybridization is dependent on conditions such as the base pair composition of the 

30 nucleotides, and the temperature and salt concentration of the reaction. These conditions 
are readily discemable to one of ordinary skill in the art using routine experimentation. 

Homologous refers to the sequence similarity or sequence identity between two 
polypeptides or between two nucleic acid molecules. When a position in both of the two 
compared sequences is occupied by the same base or amino acid monomer subunit, e.g., if 

35 a position in each of two DNA molecules is occupied by adenine, then the molecules are 
homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two sequences 
divided by the number of positions compared x 100. For example, if 6 of 10 of the 
positions in two sequences are matched or homologous then the two sequences are 60% 
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homologous. By way of example, the DNA sequences ATTGCC and TATGGC share 50% 
homology. Generally, a comparison is made when two sequences are aligned to give 
maximum homology. 

Nucleic acids are hyhridizable to each other when at least one strand of a nucleic 

5 acid can anneal to the other nucleic acid under defined stringency conditions. Stringency of 
hybridization is determined by: (a) the temperature at which hybridization and/or washing 
is performed; and (b) the ionic strength and polarity of the hybridization and washing 
solutions. Hybridization requires that the two nucleic acids contain complementary 
sequences; depending on the stringency of hybridization, however, mismatches may be 

10 tolerated. Typically, hybridization of two sequences at high stingency (such as, for 
example, in a solution of 0.5X SSC, at 65° C) requires that the sequences be essentially 
completely homologous. Conditions of intermediate stringency (such as, for example, 2X 
SSC at 65 0 C) and low stringency (such as, for example 2X SSC at 55° C), require 
correspondingly less overall complementarity between the hybridizing sequences. (IX 

15 SSCis0.15MNaO,0.015MNacitrate). 

The terms peptides, proteins, and polypeptides are used interchangeably herein. 
As used herein, the term "surface protein" refers to all surface accessible proteins, 
e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and secreted 
proteins. 

20 A polypeptide has R pylori biological activity if it has one, two and preferably 

more of the following properties: (1) if when expressed in the course of an K pylori 
infection, it can promote, or mediate the attachment of H. pylori to a cell; (2) it has an 
enzymatic activity, structural or regulatory function characteristic of an K pylori protein; 
(3) or the gene which encodes it can rescue a lethal mutation in an H. pylori gene. A 

25 polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a 
polypeptide having one of the above-listed properties. , 

A biologically active fragment or analog is one having an in vivo or in vitro activity 
which is characteristic of the H. pylori polypeptides of the invention contained in the 
Sequence Listing, or of other naturally occurring H. pylori polypeptides, e.g., one or more 

30 of the biological activities described herein. Especially preferred are fragments which exist 
in vivo, e.g., fragments which arise from post transcriptional processing or which arise from 
translation of alternatively spliced RNA's. Fragments include those expressed in native or 
endogenous cells as well as those made in expression systems, e.g., in CHO cells. Because 
peptides such as H. pylori polypeptides often exhibit a range of physiological properties 

35 and because such properties may be attributable to different portions of the molecule, a 
useful K pylori fragment or H. pylori analog is one which exhibits a biological activity in 
any biological assay for H. pylori activity. Most preferably the fragment or analog 
possesses 10%, preferably 40%, more preferably 60%, 70%, 80% or 90% or greater of the 
activity of K pylori , in any in vivo or in vitro assay. 
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Analogs can differ from naturally occurring H. pylori polypeptides in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 
include changes in acetylation, methylatiori, phosphorylation, carboxylation, or 
glycosylation. Preferred analogs include K pylori polypeptides (or biologically active 

5 fragments thereof) whose sequences differ from the wild-type sequence by one or more 
conservative amino acid substitutions or by one or more non-conservative amino acid 
substitutions, deletions, or insertions which do not substantially diminish the biological 
activity of the H. pylori polypeptide. Conservative substitutions typically include the 
substitution of one amino acid for another with similar characteristics, e.g., substitutions 

10 within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; 
aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and 
phenylalanine, tyrosine. Other conservative substitutions can be made in view of the table 
below. . 



15 TABLE 2 

CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, lie, 
D-Met, D-Ile, Om, D-Orn 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-Gln 


Aspartic Acid 


D 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-Gln 


Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


Glutamine 


Q 


D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-Gln 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro, p-Ala, Acp 


Isoleucine 


I 


D-He, Val, D-Val, Leu, D-Leu* Met, D-Met 


Leucine 


L 


D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D- 
Met, He, D-Ile, Orn, D-Orn 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, Leu, D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, 
Trans-3,4, or 5-phenylproline, cis-3,4, 
or 5-phenylproline 
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Proline 


P 


D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L- 1 - 1 
oxazohdine-4-carboxylic acid . | 


Serine 


s 


u-ociy i nr, u- 1 nr, alio* x nr f lviei, ly-Mei, Metiuj, | 
D-Met(0),L-Cys,D-Cys I 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(0), I 
D-Met(0),Val f D-Val 1 


Tyrosine 


Y 


D-Tyr,Phe,D-Phe,L-Dopa,His,D-His I 


Valine 


V 


D-Val, Leu, D-Leu, lie, D-Ile, Met, D-Met I 



Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
(which replace the peptide bonds) in the peptide sequence. Also included are: analogs that 
5 include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non- 
naturally occurring or synthetic amino acids, e.g., p or y amino acids; and cyclic analogs. 

As used herein, the term "fragment 1 ', as applied to an K pylori analog, will 
ordinarily be at least about 20 residues, more typically at least about 40 residues, preferably 
at least about 60 residues in length. Fragments of K pylori polypeptides can be generated 

1 0 by methods known to those skilled in the art. The ability of a candidate fragment to exhibit 
a biological activity of H. pylori polypeptide can be assessed by methods known to those 
skilled in the art as described herein. Also included are H. pylori polypeptides containing 
residues that are not required for biological activity of the peptide or that result from 
alternative mRNA splicing or alternative protein processing events. 

15 An "immunogenic component* 1 as used herein is a moiety, such as an H. pylori 

polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or 
cellular immune response in a host animal. 

An "antigenic component" as used herein is a moiety, such as an K pylori 
polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody 

20 with sufficiently high affinity to form a detectable antigen-antibody complex. 

As used herein, the term "transgene" means a nucleic acid (encoding, e.g., one or 
more polypeptides), which is partly or entirely heterologous, i.e., foreign, to the transgenic 
animal or cell into which it is introduced, or, is homologous to an endogenous gene of the 
transgenic animal or cell into which it is introduced, but which is designed to be inserted, or 

25 is inserted, into the cell's genome in such a way as to alter the genome of the cell into which 
it is inserted (e.g., it is inserted at a location which differs from that of the natural gene or 
its insertion results in a knockout). A transgene can include one or more transcriptional 
regulatory sequences and any other nucleic acid, such as introns, that may be necessary for 
optimal expression of the selected nucleic acid, all operably linked to the selected nucleic 

30 acid, and may include an enhancer sequence. 
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As used herein, the term "transgenic cell" refers to a cell containing a transgene: 
As used herein, a "transgenic animal" is any animal in which one or more, and 
preferably essentially all, of the cells of the animal includes a transgene. The transgene can 
be introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
5 by way of deliberate genetic manipulation, such as by a process of transformation of 
competent cells or by microinjection or by infection with a recombinant virus. This 
molecule may be integrated within a chromosome, or it may be extrachromosomally 
replicating DNA. 

The term "antibody" as used herein is intended to include fragments thereof which 

10 are specifically reactive with H. pylori polypeptides. 

As used herein, the term "cell-specific promoter" means a DNA sequence that 
serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue. The term also covers so-called "leaky" promoters, which 

1 S regulate expression of a selected DNA primarily in one tissue, but cause expression in 
other tissues as well. 

Misexpression, as used herein, refers to a non-wild type pattern of gene expression. 
It includes: expression at non-wild type levels, i.e., over or under expression; a pattern of 
expression that differs from wild type in terms of the time or stage at which the gene is 

20 expressed, e.g., increased or decreased expression (as compared with wild type) at a 

predetermined developmental period or stage; a pattern of expression that differs from wild 
type in terms of decreased expression (as compared with wild type) in a predetermined cell 
type or tissue type; a pattern of expression that differs from wild type in terms of the 
splicing size, amino acid sequence, post-transitional modification, or biological activity of 

25 the expressed polypeptide; a pattern of expression that differs from wild type in terms of 
the effect of an environmental stimulus or extracellular stimulus on expression of the gene, 
e.g., a pattern of increased or decreased expression (as compared with wild type) in the 
presence of an increase or decrease in the strength of the stimulus. 

As used herein, "host cells" and other such terms denoting microorganisms or 

30 higher eukaryotic cell lines cultured as unicellular entities refers to cells which can become 
or have been used as recipients for a recombinant vector or other transfer DNA, and include 
the progeny of the original cell which has been transfected. It is understood by individuals 
skilled in the art that the progeny of a single parental cell may not necessarily be 
completely identical in genomic or total DNA compliment to the original parent, due to 

35 accident or deliberate mutation. 

As used herein, the term "control sequence" refers to a nucleic acid having a base 
sequence which is recognized by the host organism to effect the expression of encoded 
sequences to which they are ligated. The nature of such control sequences differs depending 
upon the host organism; in prokaryotes, such control sequences generally include a 
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promoter, ribosomal binding site, terminators, and in some cases operators; in eukaryotes, 
generally such control sequences include promoters, terminators and in some instances, 
enhancers. The term control sequence is intended to include at a mintTn urri, all components 
whose presence is necessary for expression, and may also include additional components 
5 whose presence is advantageous, for example, leader sequences. 

As used herein, the term "operably linked 1 * refers to sequences joined or ligated to 
function in their intended maimer. For example, a control sequence is operably linked to 
coding sequence by ligation in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control sequence and host cell. 

1 0 The metabolism of a substance, as used herein, means any aspect of the, expression, 

function, action, or regulation of the substance. The metabolism of a substance includes 
modifications, e.g., covalent or non-covalent modifications of the substance. The 
metabolism of a substance includes modifications, e.g., covalent or non-covalent 
modification, the substance induces in other substances. The metabolism of a substance 

1 5 also includes changes in the distribution of the substance. The metabolism of a substance 
includes changes the substance induces in the distribution of other substances. 

A "sample" as used herein refers to a biological sample, such as, for example, tissue 
or fluid isloated from an individual (including without limitation plasma, serum, 
cerebrospinal fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture 

20 constituents, as well as samples from the environment ' 

The practice of the invention will employ, unless otherwise indicated, conventional 
techniques of chemistry, molecular biology, microbiology, recombinant DNA, and 
immunology, which are within the skill of the art. Such techniques are explained fully in 
the literature. See e.g. , Sambrook, Fritsch, and Maniatis, Molecular Cloning; Laboratory 

25 Manual 2nd ed. (1989); DNA Cloning, Volumes I and II (D.N Glover ed. 1985); 

Oligonucleotide Synthesis (M.J. Gait ed, 1984); Nucleic Acid Hybridization (B.D. Hames & 
SJ. Higgiiis eds. 1984); the series, Methods in Enzymoloqy (Academic Press, Inc.), 
particularly Vol. 1 54 and Vol. 1 55 (Wu and Grossman, eds.) and PCR-A Practical 
Approach (McPherson, Quirke, and Taylor, eds., 1991). 

30 

I Isolation of Nucleic Acids of H. vvlori and Uses Therefor 

H. pylori Genomic Sequence 

This invention provides nucleotide sequences of the genome of H. pylori which thus 
35 comprises a DNA sequence library of H. pylori genomic DNA. The detailed description 
that follows provides nucleotide sequences of H. pylori, and also describes how the 
sequences were obtained and how ORFs and protein-coding sequences were identified. 
Also described are methods of using the disclosed H. pylori sequences in methods 
including diagnostic and therapeutic applications. Furthermore, the library can be used as a 
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database for identification and comparison of medically important sequences in this and 
other .strains of H. pylori. 

To detennine the genomic sequence of H. pylori, DNA was isolated from a strain of 
K pylori (ATCC # 55679) and mechanically sheared by nebulization to a median size of 2 
5 kb. Following size fractionation by gel electrophoresis, the fragments were blunt-ended, 
ligated to adapter oligonucleotides, and cloned into each of 20 different pMPX vectors 
(Rice et al, abstracts of Meeting of Genome Mapping and Sequencing, Cold Spring 
Harbor, NY, 5/1 1-5/15, 1 994, p. 225) to construct a series of "shotgun" subclone libraries. 
DNA sequencing was achieved using multiplex sequencing procedures essentially 

10 as disclosed in Church etal., 1988, Science 240:1*5; U.S. PatentsNo. 4,942,124 and 
5,149,625). DNA was extracted from pooled cultures and subjected to chemical or - 
enzymatic sequencing. Sequencing reactions were resolved by electrophoresis, and the 
products were transferred and covalently bound to nylon membranes. Finally, the 
membranes were sequentially hybridized with a series of labelled oligonucleotides 

1 5 complimentary to "tag" sequences present in the different shotgun cloning vectors. In this 
manner, a large number of sequences could be obtained from a single set of sequencing 
reactions. The cloning and sequencing procedures are described in more detail in the 
Exemplification. 

Individual sequence reads obtained in this maimer were assembled using the 
20 FALCON™ program (Church etai, 1 994, Automated DNA Sequencing and Analysis, J.C. 
Venter, ed., Academic Press) and PHRAP (P. Green, Abstracts of DOE Human Genome 
Program Contractor-Grantee Workshop V, Jan. 1996, p. 157). The average contig length 
was about 3-4 kb. 

A variety of approaches are used to order the contigs so as to obtain a continuous 
25 sequence representing the entire K pylori genome. Synthetic oligonucleotides are designed 
that are complementary to sequences at the end of each contig. These oligonucleotides may 
be hybridized to libaries of H. pylori genomic DNA in, for example, lambda phage vectors 
or plasmid vectors to identify clones that contain sequences corresponding to the junctional 
regions between individual contigs. Such clones are then used to isolate template DNA and 
30 the same oligonucleotides are used as primers in polymerase chain reaction (PCR) to 
amplify junctional fragments, the nucleotide sequence of which is then determined 

The H. pylori sequences were analyzed for the presence of open reading frames 
(ORFs) comprising at least 180 nucleotides. As a result of the analysis of ORFs based on 
stop-to-stop codon reads, it should be understood that these ORFs may not correspond to 
35 the ORF of a naturally-occurring H. pylori polypeptide. These ORFs may contain start 
codons which indicate the initiation of protein synthesis of a naturally-occurring H. 
/yy/oripolypeptide. Such start codons within the ORFs provided herein can be identified by 
those of ordinary skill in the relevant art, and the resulting ORF and the encoded H. pylori 
polypeptide is within the scope of this invention. For example, within the ORFs a codon 
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such as AUG or GUG (encoding methionine or valine) which is part of the initiation signal 
for protein synthesis can be identified and the ORF modified to correspond to a naturally- 
occurring H. pylori polypeptide. The predicted coding regions were defined by evaluating 
the coding potential of such sequences with the program GENEMARK™ (Borodovsky and 
5 Mclninch, 1993, Comp. Chem. 17:123). 

Other H. pylori Nucleic Acids - 

The nucleic acids of this invention may be obtained directly from the DNA of the 
above referenced K pylori strain by using the polymerase chain reaction (PGR). See " PCR, - 

10 A Practical Approach 9 " (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, UK, 
1991) for details about the PGR. High fidelity PCR can be used to ensure a faithful DNA 
copy prior to expression. In addition, the authenticity of amplified products can be checked 
by conventional sequencing methods. Clones carrying the desired sequences described in 
this invention may also be obtained by screening the libraries by means of the PCR or by 

15 hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or 
plaques as known in the art (see, e.g., Sambrook et al., Molecular Cloning, A Laboratory 
Manual 2nd edition, 1 989, Cold Spring Harbor Press, NY). 

It is also possible to obtain nucleic acids encoding H. pylori polypeptides from a 
cDNA library in accordance with protocols herein described. A cDNA encoding an H. 

20 pylori polypeptide can be obtained by isolating total mRNA from an appropriate strain. 
Double stranded cDNAs can then be prepared from the total mRNA. Subsequently, the 
cDNAs can be inserted into a suitable plasmid or viral (e.g., bacteriophage) vector using 
any one of a number of known techniques. Genes encoding H. pylori polypeptides can also 
be cloned using established polymerase chain reaction techniques in accordance with the 

25 nucleotide sequence information provided by the invention. The nucleic acids of the 

invention can be DNA or RNA. Preferred nucleic acids of the invention are contained in 
the Sequence Listing. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 

30 are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See e.g., Itakura et aL U.S. Patent 
No. 4,598,049; Caruthers et al U.S. Patent No. 4,458,066; and Itakura U.S. Patent Nos. 
4,401,796 and 4,373,071, incorporated by reference herein). 

Nucleic acids isolated or synthesized in accordance with features of the present 

35 invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands. antisense genes and for developing expression systems for the synthesis of proteins 
and peptides corresponding to such sequences. As probes, primers, capture ligands and 
antisense agents, the nucleic acid normally consists of all or part (approximately twenty or 
more nucleotides for specificity as well as the ability to form stable hybridization products) 
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of the nucleic acids of the invention contained in the Sequence Listing. These uses are* 
described in further detail below. 

Probes 

5 A nucleic acid isolated or synthesized in accordance with the sequence of the 

invention contained in the Sequence Listing can be used as a probe to specifically detect H 
pylori. With the sequence information set forth in the present application, sequences of 
twenty or more nucleotides are identified which provide the desired inclusivity and 
exclusivity with respect to H. pylori, and extraneous nucleic acids likely to be encountered 

10 . during hybridization conditions. More preferably, the sequence will comprise at least 

twenty to thirty nucleotides to convey stability to the hybridization product formed between 
the probe and the intended target molecules. 

Sequences larger than 1000 nucleotides in length are difficult to synthesize but can 
be generated by recombinant DNA techniques. Individuals skilled in the art will readily 

1 5 recognize that the nucleic acids, for use as probes, can be provided with a label to facilitate 
detection of a hybridization product 

Nucleic acid isolated and synthesized in accordance with the sequence of the 
invention contained in the Sequence Listing can also be useful as probes to detect 
homologous regions (especially homologous genes) of other Helicobacter species using 

20 appropriate stringency hybridization conditions as described herein. 
Capture Ligand 

For use as a capture ligand, the nucleic acid selected in the manner described above 
with respect to probes, can be readily associated with a support. The manner in which 
nucleic acid is associated with supports is well known. Nucleic acid having twenty or more 

25 nucleotides in a sequence of the invention contained in the Sequence Listing have utility to 
separate H. pylori nucleic acid from the nucleic acid of each other and other organisms. 
Nucleic acid having twenty or more nucleotides in a sequence of the invention contained in 
the Sequence Listing can also have utility to separate other Helicobacter species from each 
other and from other organisms. Preferably, the sequence will comprise at least twenty 

30 nucleotides to convey stability to the hybridization product formed between the probe and 
the intended target molecules. Sequences larger than 1000 nucleotides in length are 
difficult to synthesize but can be generated by recombinant DNA techniques. 
Primers 

Nucleic acid isolated or synthesized in accordance with the sequences described 
35 herein have utility as primers for the amplification of H pylori nucleic acid. These nucleic 
acids may also have utility as primers for the amplification of nucleic acids in other 
Helicobacter species. With respect to polymerase chain reaction (PGR) techniques, nucleic 
acid sequences of > 10-15 nucleotides of the invention contained in the Sequence Listing 
have utility in conjunction with suitable enzymes and reagents to create copies of H. pylori 
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nucleic acid. More preferably, the sequence will comprise twenty or more nucleotides to 
convey stability to the hybridization product formed between the primer and the intended 
target molecules. Binding conditions of primers greater than 100 nucleotides are more 
difficult to control to obtain specificity. High fidelity PCR can be used to ensure a faithful 
5 DNA copy prior to expression. In addition, amplified products can be checked by 
conventional sequencing methods. 

The copies can be used in diagnostic assays to detect specific sequences, including 
genes from H. pylori and/or other Helicobacter species. The copies can also be 
incorporated into cloning and expression vectors to generate polypeptides corresponding to 
10 the nucleic acid synthesized by PCR, as is described in greater detail herein. 

Antisense 

Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 
accordance with the sequences described herein have utility as antisense agents to prevent 
the expression of K pylori genes. These sequences also have utility as antisense agents to 

15 prevent expression of genes of other Helicobacter species. 

In one embodiment, nucleic acid of derivatives corresponding to K pylori nucleic 
acids is loaded into a suitable carrier such as a liposome or bacteriophage for introduction 
into bacterial cells. For example, a nucleic acid having twenty or more nucleotides is 
capable of binding to bacteria nucleic acid or bacteria messenger RNA. Preferably, the 

20 antisense nucleic acid is comprised of 20 or more nucleotides to provide necessary stability 
of a hybridization product of non-naturally occurring nucleic acid and bacterial nucleic acid 
and/or bacterial messenger RNA. Nucleic acid having a sequence greater than 1 000 
nucleotides in length is difficult to synthesize but can be generated by recombinant DNA 
techniques. Methods for loading antisense nucleic acid in liposomes is known in the art as 

25 exemplified by U.S. Patent 4,24 1 ,046 issued December 23, 1 980 to Papahadjopoulos et al. 

II. Expression of H pylori Nucleic Acids 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility to generate polypeptides. The nucleic acid of the invention exemplified 
30 in the Sequence Listing or fragments of said nucleic acid encoding active portions of H 
pylori polypeptides can be cloned into suitable vectors or used to isolate nucleic acid. The 
isolated nucleic acid is combined with suitable DNA linkers and cloned into a suitable 
vector. 

The function of a specific gene or operon can be ascertained by expression in a 
35 bacterial strain under conditions where the activity of the gene product(s) specified by the 
gene or operon in question can be specifically measured. Alternatively, a gene product may 
be produced in large quantities in an expressing strain for use as an antigen, an industrial 
reagent, for structural studies, etc. This expression can be accomplished in a mutant strain 
which lacks the activity of the gene to be tested, or in a strain that does not produce the 
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same gene produces). This includes, but is not limited to other Helicobacter strains, or 
other bacterial strains such as & colU Norcardia, Corynebacterium, Campylobacter, and 
Streptomyces species. In some cases the expression host will utilize the natural 
Helicobacter promoter whereas in others, it will be necessary to drive the gene with a 
5 promoter sequence derived from the expressing organism (e.g., an E. coli beta- 
galactosidase promoter for expression in £. coli). 

To express a gene product using the natural H pylori promoter, a procedure such as 
the following can be used. A restriction fragment containing the gene of interest, together 
with its associated natural promoter element and regulatory sequences (identified using the 

10 DNA sequence data) is cloned into an appropriate recombinant plasmid containing an 
origin of replication that functions in the host organism and an appropriate selectable- 
marker. This can be accomplished by a number of procedures known to those skilled in the 
art. It is most preferably done by cutting the plasmid and the fragment to be cloned with 
the same restriction enzyme to produce compatible ends that can be ligated to join the two 

15 pieces together. The recombinant plasmid is introduced into the host organism by, for 
example, electroporation and cells containing the recombinant plasmid are identified by 
selection for the marker on the plasmid. Expression of the desired gene product is detected 
using an assay specific for that gene product 

In the case of a gene that requires a different promoter, the body of the gene (coding 

20 sequence) is specifically excised and cloned into an appropriate expression plasmid. This 
subcloning can be done by several methods, but is most easily accomplished by PGR 
amplification of a specific fragment and ligation into an expression plasmid after treating 
the PGR product with a restriction enzyme or exonuclease to create suitable ends for 
cloning. 

25 A suitable host cell for expression of a gene can be any procaryotic or eucaryotic 

cell. For example, an H. pylori polypeptide can be expressed in bacterial cells such as R 
coli 9 insect cells (baculovirus), yeast, or mammalian cells such as Chinese hamster ovary 
cell (CHO). Other suitable host cells are known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead to 

30 partial or complete glycosylation and/or formation of relevant inter- or intra-chain disulfide 
bonds of a recombinant peptide product. Examples of vectors for expression in yeast 
S. cerivisae include pYepSecl (Baldari. et al., (1987) Embo J. 6:229-234), pMFa (Kurjan 
and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et al., (1987) Gene 54:113- " 
123), and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors available 

35 for expression of proteins in cultured insect cells (SF 9 cells) include the pAc series (Smith 
et al., (1 983) Mol Cell Biol 3:2156-2165) and the pVL series (Lucklow, V A., and 
Summers, M.D., (1989) Virology 170:31-39). Generally, COS cells (Gluzman, Y., (1981) 
Cell 23:175-1 82) are used in conjunction with such vectors as pCDM 8 (Aruffo, A. and 
Seed, B„ (1 987) Proc. Natl Acad. Set USA 84:8573-8577) for transient 
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amplification/expression in mammalian cells, while CHO (dhfr Chinese Hamster Ovary) 
cells are used with vectors such as pMT2PC (Kaufman et al. (1987), EMBO J. 6: 187-195) 
for stable amplification/expression in mammalian cells. Vector DNA can be introduced 
into mammalian cells via conventional techniques such as calcium phosphate or calcium 

5 chloride co-precipitation, DEAE-dextran-mediated transfection, or electroporation. 

Suitable methods for transforming host cells can be found in Sambrook et al. (Molecular 
Cloning: A Laboratory Manual. 2nd Edition, Cold Spring Harbor Laboratory press 
(1989)), and other laboratory textbooks. 

Expression in procaryotes is most often earned out in E. coli with either fusion or 

10 non-fusion inducible expression vectors. Fusion vectors usually add a number of NH2 
terminal amino acids to the expressed target gene. These NH2 terminal amino acids often 
are referred to as a reporter group. Such reporter groups usually serve two purposes: 1 ) to 
increase the solubility of the target recombinant protein; and 2) to aid in the purification of 
the target recombinant protein by acting as a ligand in affinity purification. Often, in fusion 

1 5 expression vectors, a proteolytic cleavage site is introduced at the junction of the reporter 
group and the target recombinant protein to' enable separation of the target recombinant 
protein from the reporter group subsequent to purification of the fusion protein. Such 
enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and 
enterokinase. Typical fusion expression vectors include pGEX (Amrad Corp., Melbourne, 

20 Australia), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, 

Piscataway, NJ) which fuse glutathione S-transferase, maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. A preferred reporter group is poly(His), 
which may be fused to the amino or carboxy terminus of the protein and which renders the 
recombinant fusion protein easily purifiable by metal chelate chromatography. 

25 Inducible non-fusion expression vectors include pTrc (Amann et al., (1 988) Gene 

69:301-315) and pETlld (Studier et al., Gene Expression Technology: Methods in 
Enzvmologv 185. Academic Press, San Diego, California (1990) 60-89). While target gene 
expression relies on host RNA polymerase transcription from the hybrid trp-lac fusion 
promoter in pTrc, expression of target genes inserted into pETl Id relies on transcription 

30 from the T7 gnlO-lac 0 fusion promoter mediated by coexpressed viral RNA polymerase 
(T7 gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) 
from a resident X prophage harboring a T7 gnl under the transcriptional control of the 
lacUV 5 promoter. 

For example, a host cell transfected with a nucleic acid vector directing expression 
35 of a nucleotide sequence encoding an K pylori polypeptide can be cultured under 

appropriate conditions to allow expression of the polypeptide to occur. The polypeptide 
may be secreted and isolated from a mixture of cells and medium containing the peptide. 
Alternatively, the polypeptide may be retained cytoplasmically and the cells harvested, 
lysed and the protein isolated. A cell culture includes host cells, media and other 
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byproducts. Suitable media for cell culture are well known in the art. Polypeptides of the 
invention can be isolated from cell culture medium, host cells, or both using techniques 
known in the art for purifying proteins including ion-exchange chromatography, gel 
filtration chromatography, ultrafiltration, electrophoresis, and immunoaffinity purification 

5 with antibodies specific for such polypeptides. Additionally, in many situations, 
polypeptides can be produced by chemical deavagp of a native protein (e.g., tryptic 
digestion) and the cleavage products can then be purified by standard techniques. 

In the case of membrane bound proteins, these can be isolated from a host cell by 
contacting a membrane-associated protein fraction with a detergent forming a solubilized 

1 0 complex, where the membrane-associated protein is no longer entirely embedded in the 
membrane fraction and is solubilized at least to an extent which allows it to be 
chromatographically isolated from the membrane fraction. Several different criteria are 
used for choosing a detergent suitable for solubilizing these complexes. For example, one > 
property considered is the ability of the detergent to solubilize the H. pylori protein within 

15 the membrane fraction at minimal denaturation of the membrane-associated protein 

allowing for the activity or functionality of the membrane-associated protein to return upon 
reconstitution of the protein. Another property considered when selecting the detergent is 
the critical micelle concentration (CMC) of the detergent in that the detergent of choice 
preferably has a high CMC value allowing for ease of removal after reconstitution. A third 

20 property considered when selecting a detergent is the hydrophobicity of the detergent 
Typically, membrane-associated proteins are very hydrophobic and therefore detergents 
which are also hydrophobic, e.g., the triton series, would be useful for solubilizing the 
hydrophobic proteins. Another property important to a detergent can be the capability of 
the detergent to remove the H. pylori protein with minimal protein-protein interaction 

25 facilitating further purification. A fifth property of the detergent which should be 

considered is the charge of the detergent. For example, if it is desired to use ion exchange 
resins in the purification process then preferably detergent should be an uncharged 
detergent. Chromatographic techniques which can be used in the final purification step are 
known in the art and include hydrophobic interaction, lectin affinity, ion exchange, dye 

30 affinity and immunoaffinity. 

One strategy to maximize recombinant H. pylori peptide expression in & coli is to 
express the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzvmologv 185. Academic Press, San Diego, California (1 990) 1 1 9-128). Another 

35 strategy would be to alter the nucleic acid encoding an H. pylori peptide to be inserted into 
an expression vector so that the individual codons for each amino acid would be those 
preferentially utilized in highly expressed £ coli proteins (Wada et al., (1992) Nuc. Acids 
Res. 20:21 11-2118). Such alteration of nucleic acids of the invention can be carried out by 
standard DNA synthesis techniques. 
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* 

The nucleic acids of the invention can also be chemically synthesized using 
s tandar d techniques. Various methods of chemically synthesizing polydeoxynucleotides 
are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. Patent 
5 No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. Patent Nos. 
4,401,796 and 4,373,071, incorporated by reference herein). 

III. H. pylori Polypeptides 

This invention encompasses isolated K pylori polypeptides encoded by the 

10 disclosed H. pylori genomic sequences, including the polypeptides of the invention 

contained in the Sequence Listing. Polypeptides of the invention are preferably at least 5 
amino acid residues in length. Using the DNA sequence information provided herein, the 
amino acid sequences of the polypeptides encompassed by the invention can be deduced 
using methods well-known in the art It will be understood that the sequence of an entire 

1 5 nucleic acid encoding an H. pylori polypeptide can be isolated and identified based on an 
ORF that encodes only a fragment of the cognate protein-coding region. This can be 
acheived, for example, by using the isolated nucleic acid encoding the ORF, or fragments ' 
thereof, to prime a polymerase chain reaction with genomic H. pylori DNA as template; 
this is followed by sequencing the amplified product 

20 The polypeptides of the invention can be isolated from wild-type or mutant K 

pylori cells or from heterologous organisms or cells (including, but not limited to, bacteria, 
yeast, insect, plant and mammalian cells) into which an K pylori nucleic acid has been 
introduced and expressed. In addition, the polypeptides can be part of recombinant fusion 
proteins. 

25 H. pylori polypeptides of the invention can be chemically synthesized using 

commercially automated procedures such as those referenced herein. 

Many of the polypeptides of the invention are related to one another. Some of these 
relationships are described in Table 3 below. Most polypeptides described in Table 3 are 
over 90% identical to one another as noted in the last two columns; some are between 70% 

30 and 90% identical to one anpther, and very few share between 60% and 70% identity with 
each other. The polypeptides represented by the sequence identification numbers in the 
third column of Table 3 result from translations carried out from stop codon to stop codon 
in the genomic nucleotide sequence of the invention, while those in the first column result 
from translations carried out from the first methionine or valine codon following the prior 

35 stop codon and proceeding to the final stop codon in the nucleotide sequence. In some 
cases, the nucleotide sequence encoding the related polypeptides is slightly different, 
resulting in some differences in amino acid residues of the related polypeptides. In many 
cases, the related polypeptides differ significantly in length, with one polypeptide 
containing amino acid residues in addition to those in common between the two 
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polypeptides. In all cases, the relationships described in Table 3 are highly significant, and 
the nucleotide sequences encoding these related polypeptides ate also very similar to one 
another. For example, the nucleotide probes derived from the coding sequence of the 
polypeptides in column one can be used in PCR or hybridization experiments to identify 

5 clones carrying the nucleotide sequence encoding the polypeptides of column three. 

The relationships between the polypeptides shown in Table 3 can be classified in 
five broad categories as follows. First, for many polypeptides (designated " A" in the last 
column of the Table 3), the polypeptide denoted in column one is identical to the 
polypeptide denoted in column three except for an occasional addition of a few putative 

1 0 amino acid residues at the N-tenninus which result from the fact that the polypeptides of 
column three were derived by translating from stop codon to stop codon instead of from a 
predicted start codon (i.e., Met or Val) to a stop codon as was done for the polypeptides in 
column one. 

Second, for most polypeptides (designated "B M in the last column of the Table 3), 
15 the polypeptide of column one is at least 95% identical to the polypeptide of column three 
; except that the polypeptide in column three- is longer (at either or both ends) by one or more 
amino acid residues which do not result from the difference between reading from stop to 
stop instead of from start to stop. 

Third, for some polypeptides (designated "C M in the last column of Table 3), the 
20 converse is true, the polypeptide of column one is at least 95% identical to the polypeptide 
of column three except that the polypeptide in column three is shorter (at either or both 
ends). 

Fourth, for some polypeptides (designated "D w in the last column of Table 3), the 
polypeptide of column one shares a high level of amino acid identity (i.e., at least 95%) 

25 with the polypeptide of column three in the region in which they overlap, but shares little or 
no identity (i.e., less than 95%) at one or both ends. The level of identity of the 
polypeptides in columns one and three in categories "B M , "C" and "D" is highly significant 
For example, a typical H. pylori gene product will exhibit amino acid sequence identities of 
between 92% to 100% among strains of R pylori isolated from human patients (see Table 

30 10 below). 

Finally, a fifth class of polypeptides in column one (designated "E" in the last 
column of Table 3) are closely related but differ significantly (i.e., less than 95% identical) 
from the polypeptide of column three. These polypeptides are likely "paralogs," members 
of related gene families in H. pylori 
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Length 




Length 


% Identity: 




SeqID# 


(aa) 


SeqIDtf 


(aa) 


Overlap Length 


Category 






384 


509 


1779 


593 


100.0 : 504 aa 


B 


386 


133 


1713 


205 


99.2: 127 aa 


B 


387 


158 


1743 


340 


98.7: 155 aa 


B 


388 


112 


1489 


330 


100.0 : 105 aa 


B 


389 


650 


1643 


369 


95.4 : 65 aa 


D 


390 


62 


1598 


327 


90.2 : 61 aa 


£ 


391 


619 


1747 


991 


99.3 :608aa 


B 


392 


110 


1570 


183 


93.1 : 101 aa 


E 


393 


68 


1811 


85 


97.1 : 68 aa 


B 


394 


446 


1884 


594 


99.5 : 433 aa 


B 


395 


84 


1503 


347 


100.0 : 84 aa 


B 


396 


40 


1504 


519 


88.2 : 34 aa 


£ 


397 


300 


1531 


.273 


98.9 : 267 aa 


C 


398 


214 


1795 


197 


99.0 : 192 aa 


C 


399 


137 


1639 


199 


96.4: 137 aa 


B 


400 


272 


1725 


285 


99.3: 271 aa 


B 


401 


287 


1470 


455 


98.9 : 277 aa 


D 


402 


106 


1596 


302 


99.1: 106 aa 


B 


403 


157 


1654 


197 


99.3: 150 aa 


B 


404 


278 


1552 


362 


100.0 : 277 aa 


B 


405 


120 


1858 


529 


100.0: 116 aa 


B 


406 


226 


1562 


297 


100.0: 216 aa 


B 


407 


62 


1752 


157 


77.6 : 58 aa 


E 


408 


50 


1663 


74 


97.9 : 47 aa 


B 


410 


188 


1803 


351 


87.1 : 155 aa 


£ 


411 


130 


1685 


497 


100.0: 129 aa 


B 


412 


183 


1843 


185 


100.0 : 183 aa 


A 


413 


194 


1874 


508 


97.6 : 169 aa 


D 


414 


235 


1518 


246 


100.0 : 235 aa 


B 


415 


88 


1538 


676 


97.5 :80aa 


B 


416 


109 


1871 


335 


99.1 : 109 aa 


B 


417 


107 


1782 


593 


94.4 : 90 aa 


E 


418 


136 


1544 


441 


100.0 : 135 aa 


B 


419 


54 


1513 


175 


97.0 : 33 aa 


E 


420 


125 


1642 


539 


96.7 : 122 aa 


B 


421 


249 


1738 


^ A £ 

346 


99.6 : 249 aa 


B 


422 


86 


1715 


128 


97.5 :79aa 


B 


423 


128 


1534 


149 


100.0 : 128 aa 


B 


424 


93 


1875 


178 


100.0 : 93 aa 


B 


425 


88 


1804 


105 


97.7 : 88 aa 


B 


426 


128 


1524 


116 


62.4:117aa 


E 


427 


108 


1762 


109 


100.0 : 108 aa 


A 


428 


118 


1539 


276 


100.0: 118 aa 


B 


429 


54 


1754 


113 


89.6 : 48 aa 


E 


430 


288 


1483 


377 


95.5 : 286 aa 


B 


431 


303 


1785 


248 


98.2: 170 aa 


D 


431 


303 


1784 


148 


96.4 : 138 aa 


D 


432 


192 


1478 


408 


99.5 : 192 aa 


B 


433 


168 


1517 


426 


99.3 : 151 aa 


D 
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A1A 

434 


153 


1679 


A /\1 

403 


1 AA A.I C5 

100.0 : 153 aa 


B 


A IK 

435 


i ^a 
162 


1806 


173 


AO 1 . 1£1 

98.1 : 161 aa 


B 


430 


59 


1 Q1A 

1834 


OA 

80 


in c • nA 

37.5 : 24 aa 


E 


43 / 


222 


1533 


OOO 

288 


1 AA A • AA 1 mm 

100.0 : 221 aa 


B 


43 o 


ki 
53 


i nm 
1/3/ 


A£A 

260 


93.5 : 31 aa 


E 


43 V 


- 109 


iOo5 


1 Al 

121 


1 AA A.I AA mm 

ioo.o : 109 aa 


B 


AAt\ 

44U 


73 


i nn*\ 
17/3 


AOA 

280 


1 AA A . ti mm 

100.0 : 73 aa 


B 


A At 

441 


23/ 


lOoD 


A AT 

497 


C\C A.I AO mm 

96.0 : 198 aa 


D 


A A") 

442 


AA 

92 


1521 


o4o 


1 AA A » OA — 

100.0 : 84 aa 


D 


A A1 

443 


AT 

- 97 


1 TT/I 

1774 


OAT 

327 


1 AA A . C\C 

100.0 : 96 aa 


B 


AAA 

444 


280 


1 CCC 

1556 


AOil 

284 


i f\t\ g\ _ OOA 

100.0: 280 aa 


A 


A AC 

445 


187 - 


1604 


253 


95.5 : 155 aa 


D 


A AC 

446 


CO 

58 


1 C 1 A 

1510 


198 


96.4 : 55 aa 


B 


447 


85 


1496 


155 


100.0 : 85 aa 


B 


448 


AA 

90 


1 "JA >4 

1724 


90 


100.0 : 90 aa 


A 


449 


1 AC 

105 


1 TOO 

1788 


141 


98.0 : 100 aa 


D 


450 


1 ta 

172 


1 A AC 

1495 


M, M M, 

239 


100.0 : 172 aa 


B 


A C 1 

451 


212 


1 ACC 

1456 


139 


81.7 : 126 aa 


E 


452 


182 


1 TAA 

1702 


192 


97.0 : 166 aa 


D 


453 


224 


1658 


228 


100.0 : 224 aa 


A 


A AT A 

454 


131 


4 HAP 

1895 


159 


100.0 : 131 aa 


B 


455 


73 


1787 


370 


100.0 : 54 aa 


D 


456 


92 


1 587 


295 


96.6 : 87 aa 


B 


457 


205 


1 735 


310 


100.0 : 195 aa 


B 


458 


41 


1479 


144 


97.6: 41 aa 


B 


459 


219 


1 584 


297 


« aP\_ aT\, a*\, aw* m a*%, 

100.0 : 219 aa 


B 


460 


« on 

188 


1 546 


375 


89.3 : 187aa 


E 


461 


231 


1780 


438 


97.3 : 225 aa 


B 


462 


63 


1645 


71 


100.0 : 63 aa 


B 


463 


183 


1664 


242 


97.8 : 182aa 


B 


464 


117 


1586 


234 


99.1 : 113aa 


B 


465 


153 


1773 


280 


98.0: 153 aa 


B 


466 


43 


1610 


308 


100.0: 43 aa 


B 


467 


240 


1636 


255 


97.0 : 237 aa 


B 


a s n 

468 


209 


1868 


240 


96.6 : 206 aa 


B 


469 


95 


1 628 


321 


98.5 : 65 aa 


D 


470 


229 


1595 


**** ^ 
375 


96.5 : 228 aa 


B 


47 1 


92 


i ti o 

1713 


205 


90.1 : 71 aa 


E 


>#ta 

472 


96 


1 if AA 

1492 


352 


95.4 : 87 aa 


B 


473 


1 1 TO 

1178 


1 1 A O 

1748 


1 1 0*5 

1183 


100.0 : 1178 aa 


A 


474 


1 CI 

163 


1 OAA 

1822 


113 


AA *7 _ 1 f\f\ 

92.7 : 109aa 


E 


474 


1 CI 

163 


1 OAA 

1820 


OO 

88 


A - 1 . CO 

93.1 : 58 aa 


E 


4/5 


ac a 
400 


1 QA A 
1840 


Ann 
467 


1 AA A . ACC mm 

100.0 : 466 aa 


A 

A 


4/o 


Cf\ 

00 


150U 


AA 

90 


AT A . mm 

97.0 : 33 aa 


D 


Ann 


nAQ 
249 


1854 


AC/I 

254 


1 AA A . AjIO mm 

100.0 : 248 aa 


n 
D 


4/8 


AA 

44 


1 *7*7A 


528 


1 A A A • A A mm 

100.0 : 44 aa 


T\ 

D 


VITA 

479 


421 


1093 


421 


AA O . >IA1 mm 

99.8 : 421 aa 


A 

A 


A OA 

480 


1 ^T 

167 


1555 


340 


98.1 : 162 aa 


B 


A O 1 

481 


OA 

89 


1 OOO 

1888 


236 


100.0 : 89 aa 


B 


482 


237 


1873 


461 


97.5 : 236 aa 


B 




4/1 


1594 


AnQ 
42o 


AA 9 . 1A1 mm 

99.3 : 301 aa 


c 


484 


123 


1655 


366 


87.9 : 107 aa 


E 


485 


127 


1655 


366 


99.2 : 127 aa 


B 


486 


259 


1517 


259 


100.0 : 259 aa 


A 


487 


17 


1481 


101 


93.3 : 15 aa 


E 


488 


77 


1704 


246 


100.0 : 77 aa 


B 



WO 96/40893 PCT/US96/09122 



-64- 



Hoy 


1 o a 

180 


1 CCA 

1554 


180 


100.0 : 180 aa 


A 


AQf\ 


Oil 

213 


lo3© 


577 


92.3 : 209 aa 


£ 


401 

*fyi 


1 AO 

148 


1452 


157 


100.0: 148 aa • 


B 


4y/ 


28 1 


1 CI A 

1519 


«49 


99.6 : 281 aa 


B 


AG1 

493 


144 


1525 


377 


94.9 : 118 aa 


D 


AClA 

494 


73 


1 OA 

1539 


276 


95.8 : 71 aa 


B 


495 


311 


1670 


446 


99.0: 311 aa 


B - 


496 


115 


1842 


384 


99.1 : 114aa 


B 


497 


200 


1600 


459 


100.0: 200 aa 


B 


498 


264 


1828 


458 


100.0 : 264 aa 


B 


499 


ha 

339 


1490 


362 


99.7 : 339 aa 


B 


500 


146 


1449 


430 


94.4: 142 aa 


£ 


501 


60 


1465 


240 


98.3 : 58 aa 


B 


502 


351 


1458 


532 


98.6 : 346 aa 


B 


503 


233 


1666 


296 


94.3 : 230 aa 


£ 


504 


32 


1513 


175 


100.0 : 32 aa 


B 


505 


171 


1462 


452 


91.8: 171 aa 


B 


506 


33 


1672 


121 


100.0 : 24 aa 


D 


507 


100 


1650 


192 


80.0 : 90 aa 


E 


508 


79 


1772 


528 


80.8 : 78 aa 


E 


509 


70 


1662 


88 


96.2 : 53 aa 


D 


510 


130 


1530 


131 


100.0: 130 aa 


A 


5ll 


53 


1723 


163 


91.8: 49 aa 


E 


512 


79 


1572 


189 


98.5 : 65 aa 


D 


513 


102 


1889 


114 


99.0 : 100 aa 


B 


514. 


15 


1516 


407 


100.0 : 12 aa 


B 


515 


378 


1747 


991 


97.9 : 377 aa 


B 


516 


1027 


1692 


2440 


100.0: 1027 aa 


B 


517 


155 


1831 


182 


89.6: 154 aa 


E 


518 


62 


1552 


362 


74.5: 47 aa 


E 


519 


237 


1557 


241 


100.0 : 237 aa 


. A 


520 


83 


1627 


424 


100.0 : 83 aa 


B 


522 


64 - 


1680 


227 


98.4 : 63 aa 


B 


523 


228 


1532 


241 


100.0 : 228 aa 


B 


524 


273 


1710 


189 


97.2: 180 aa 


C 


524 


273 


1712 


115 


100.0 : 99 aa 


C 


525 


15 


1612 


65 . 


100.0 : 15 aa 


B 


526 


115 


1733 


112 


94.6: 111 aa 


C 


527 


67 


1755 


156 


100.0 : 62 aa 


D 


528 


323 


1635 


336 


100.0: 322 aa 


B 


529 


10 


1731 


101 


50.0: 10 aa 


B 


a a 

530 


12 


1455 


284 


100.0 : 12 aa 


B 


CI 1 

531 


*ta 

79 


1528 


101 


94.9 : 79 aa 


B 


c^i 

532 


Ac 

95 


1887 


279 


100.0 : 95 aa 


B 


533 • 


154 


1591 


168 


100.0: 152 aa 


B 


534 


68 


1501 


283 


100.0: 67 aa 


B 


CI c 

535 


313 


1519 


849 


95.5: 313 aa 


B 


536 


319 


I486 


325 


100.0: 319 aa 


A 


537 


118 


1707 


154 


90.9: 110 aa 


E 




89 


1674 


161 


88.2 : 76 aa 


E 


539 


. 92 


1838 


100 


100.0 : 92 aa 


A 


540 


138 


1525 


377 


86.9 : 137 aa 


B 


541 


277 


1661 


394 


96.0 : 273 aa 


D 


542 


254 


1567 


392 


98.0 : 254 aa 


B 


543 


185 


1467 


594 


99.5: 185 aa 


B 


544 


37 


1886 


326 


100.0 : 37.aa 


B 



WO 96/40893 PCT/US96/09122 



-65- 



CA C 

545 


94 


1772 


528 


89.1 : 92 aa 


E 


CA£ 

546 


182 


i tca 

1760 


366 


100.0 : I82aa 


B 


547 


247 


1776 


150 


1 /\/\ A.I jl^ 

100.0 : 146 aa 


C 


CA V 

547 


*\ A*1 

247 


1775 


on 

88 


1 OO /\ - OT _ " 

100.0: 87 aa 


C 


548 


>IOO 

422 


1798 


426 


1 i\f\ f\ - >IOO _ ' 

100.0 : 422 aa 


• A 


549 


61 


1620 


78 


ft/ *7 » //\ 

96.7 : 60 aa 


B 


CCA 

550 


54 


t C££ 

1566 


O A A 

309 


37.8 : 45 aa 


E 


CC1 

551 


298 


1 *7C 1 

1751 


C 1 o 

518 


/\ _a, a*» 

98.0 :297 aa 


B 


CO 

552 


. At 

91 


1738 


.346 


95.5 : 88 aa 


D 


ceo 

553 


1 1 1 
111 


1764 


1 OA 

130 


1 OO O 111 

100.0 : 111 aa 


B 


CCA 

554 


137 


1 /O /* 

1626 


93 


1 OO O O 1 

100.0 : 84 aa 


D 


ccc 

555 


124 


1717 


114 


89.2 : 111 aa 


E 


556 


oc 

86 


1597 


432 


ft O 

89.7 : 78 aa 


E 


CC*7 

557 


87 


1457 


97 


1 oo An/ ' '"' 

100.0 : 86 aa 


B 


ceo 

558 


108 


1750 


1 ^ A 

154 


AA 4 * AA 

99.1 : 108 aa 


B 


E c t\ 

559 


142 


1619 


211 


97.9 : 14l aa 


B 


560 


a a « 

23 1 


1580 


315 


99.6 : 23 1 aa 


B 


561 


186 


1446 


275 


88.3 : 180aa 


E 


562 


1 83 


1515 


198 


100.0 : 183aa 


B 


563 


154 


1477 


188 


100.0 : 154 aa 


B 


564 


288 


1767 


303 


100.0 : 288 aa 


B 


565 


420 


1841 


424 


100.0 : 420 aa 


A 


566 


72 


1450 


486 


98.5 : 67 aa 


D 


567 


205 


1799 


270 


99.5:l99aa 


D 


568 


328 


1893 


338 


I00.0:328aa 


B 


569 


1 40 


1855 


614 


84.8 : 125 aa 


E . . 


570 


76 


1499 


184 


98.1 : 53 aa 


D 


571 


194 


1660 


201 


100.0 : 193 aa 


B 


572 


140 


1583 


163 


96.9 : 130 aa 


D 


573 


308 


1676 


316 


100.0 : 308 aa 


B 


574 


339 


1667 


468 


99. 1 : 335 aa 


B 


575 


207 


1688 


208 


98. 1 : 207 aa 


A 


576 


251 


1891 


267 


100.0 : 25 1 aa 


B 


577 


69 


1745 


109 


96.7 : 61 aa 


D 


578 


112 


1519 


849 


99.1 : 112 aa 


B 


579 


152 


1649 


543 


98.7 : 152 aa 


B 


580 


130 


1853 


220 


Af\. f% M * AM* f\. 

98.3 : 120 aa 


D 


581 


113 


1 / 1 ^ 

1614 


134 


« AA A * 4 A 

100.0: 113 aa 


B 


582 


174 


1569 


209 


« A A A * A 

100.0: 174 aa 


B 


^ ft A 

583 


35 


1752 


157 


34.4: 32 aa 


E 


£ O A 

584 


*t Oft 

308 


« MA A 

1728 


316 


« AA A A AM 

100.0 : 307 aa 


B 


f Of 

585 


702 


1 o r**7 

1 857 


797 


A A A MA A 

99.9 : 702 aa 


B 


586 


293 


1 633 


664 


96.2 : 293 aa 


B 


COT 

587 


CO 

52 


1 C A*\ 

1542 


ilO A 

429 


1 AA A CO 

100.0 : 52 aa 


B 


coo 

588 


182 


1 Cf\A 

1504 


C 1 A 

519 


1 f\f\ f\ _ 1 oo 

100.0 : 182 aa 


B 


con 

589 


OC 1 

251 


1 COO 

1622 


oco 

262 


AA C . OC1 

99.6 : 25 1 aa 


B 


COA 

590 


Af\ 

40 


1 coo 

1632 


/ « 
61 


97.5 : 40 aa 


B 


591 


122 


1691 


717 


A A . 1 1 A 

99.l:ll3aa 


D 


e /\o 

592 


113 


1 I/O 

1768 


I2l 


* A O A « « 

100.0 : 1 13 aa 


B 


593 


437 


1520 


448 


4 Af\. f\. J\. A mm 

100.0 : 437 aa 


A 


CO/1 


140 


1549 


2UU 


AO O - 1 /I — . — 

93.2 : 146 aa 


E 


595 


128 


1494 


I3l 


99.2: 127 aa 


B 


596 


95 


1667 


468 


100.0 : 93 aa 


B 


597 


102 


1447 


149 


100.0 : 91 aa 


D 


598 


127 


1879 


138 


100.0 : 127 aa 


B 


599 


502 


1881 


928 


99.2:500.aa 


B 



WO 96/40893 



PCT/US96/09122 



-66- 



600 


42 


1625 


91 


100 0 • 42 an 


R 


601 


341 


1454 


381 


99 7 • 341 aa 




602 


465 


1694 


442 


98 2 * 433 aa . 


r> 


602 


465 


1896 


259 


99 6 * 238 aa 




603 


193 


1484 


229 


99 4 • 1 70 aa 


r\ 


604 


141 


1673 


130 


97 6 • 196 «a 


n 


605 


159 


1582 


358 


83 3 • 156 aa 


i? 
t» 


606 


376 


1565 


382 


100 0 • 376 aa 


T> 


607 


352 


1862 


363 


99 1 * 359 an 


T> 
£> 


608 


88 


1807 


144 


98 7 • 77 aa 


U 


609 


86 


1553 


181 


72 0 • 89 aa 


ri 


610 


76 


1527 


77 


100 0 • 76 aa 


A 


611 


98 


1732 


118 

A A U 


08 0 • 0^ ao 


13 


612 


162 


1525 


377 


01 1 * 1A1 QO 

• 101 aa 


£ 


613 


58 


1547 


152 


QQ *2 • CO — 0 

foj . do aa 


B 


614 


66 


1470 


455 




£ 


615 


117 

A A # 


1844 


98? 


7j.d . 1 1 1 aa 


D 


616 


83 


1723 


163 


70.j , oz aa 


B 


617 


77 


1763 


97S 


ov.o : / / aa 


E 


618 


176 

X i XI 


1U7U 


140 


Ql < • IOC —a 

yj.o : 125 aa 


E 


618 


176 


1608 

1070 


9^1 


OO 1 • « Alb 

V2.J : 65 aa 


E 


619 


144 


14Q8 

l*t70 


577 


70.U : 124 aa 


D 


620 


90 


1516 


ACM 


no n . on — _ 

7o.v : yu aa 


B 


621 


268 


1571 


97V 


luu.u : 268 aa 


B 


622 


171 

X / X 


1753 


^uu 


: 171 aa 


B 


623 


102 


1550 




xuu.u : 84 aa 


D 


624 


117 

XX/ 


1819 

1 O 1 7 


fixlft 
U*tU 


74.2 : 1U4 aa 


E 


625 


237 


1758 

1 /JO 


R49 


77.o : 2J3 aa 


B 


626 


199 

X 77 


1464 
i*tu*t 




luu.u : lye aa 


C 


627 


225 


161 1 

XUl 1 


147 


07 *7 • Oil 

7/./ : 221 aa 


B 


627 


225 


1608 


07 

7/ 


07 9 • Q1 #»» 

y/.o : yi aa 


C 


628 


86 


1772 

1 / 1 4m 


598 


7U.4 : 03 aa 


E 


629 


496 


1558 


530 


luu.u . wo aa 


B 


630 


142 

A • 


1778 


339 


QQ 1 • 1 1< o« 

77.1 . i id aa 


U 


631 


153 • 


1709 


111 
111 


76 Q • 1HQ Q n 


tr 

c- 


631 


153 


1708 


76 


fl\ Q • AjtL oq 




633 


93 


1848 


106 


1 00 0 • a or 


D 


634 


177 


1603 


531 


98 0 * 175 aa 
70*7 • 1 / J cut 


13 


635 


88 


1812 


93 


1 00 0 * 88 aa 


A 


636 


115 


1746 


122 


100 0 • 1 15 aa 


D 


637 


261 


1850 


304 


98 5 * 260 aa 


P 


638 


191 


1691 


717 


99 5 • 185 aa 


O 


639 


351 


1697 


373 


99 7 • 351 aa 


D 


640 


351 


1601 


355 


100 0 • 342 aa 


n 


641 


146 


1592 


320 


96 6 * 145 aa 




642 


190 


1796 


376 


100 0 • 190 aa 


o 


643 


489 


1705 


957 


99 4 * 468 aa 

//♦t • *tUO Oct 


n 


644 


249 


1706 


?49 

mat S 


100 0*948aa 

1UU.W • Z*tO aa 


A 
A 


645 


120 


1540 


269 


100 0 • IHiaa 


U 


646 


109 


1550 


350 


100.0 : 108 aa 


B 


648 


158 


1573 


237 


95.5 : 155 aa 


D 


649 


312 


1721 


312 


100.0: 3 12 aa 


A 


650 


56 


1839 


242 


98.1: 52 aa 


D 


651 


199 


1813 


261 


99.5 : 196 aa 


B 


652 


82 


1756 


99 


96.3: 81 aa 


D 


653 


89 


1475 


227 


100.0: 89 aa 


B 



WO 96/40893 PCT/US96/09122 



-67- 



A^A 


111 
111 


1 «^ 

IDDD 


110 

1 iy 


Qfi 9 * 1 1 1 aa 

70Z • 1 1 1 OA 






A< 
03 


1 AfiA 
1050 


07A 

z/o 






ASA 


OAft 
Z05 


1ftQ9 
I07Z 


070 
Z/Z 


1 00 n * O f\R aa 


A 
A 


AS7 


1 nn 

1UU 


1 SQ7 


•AT> 
4>Z 


1 nn n * ra aa 


n 
u 


A5R 


fin 

5U 


1 AAO 


07A 
Z/O 


Q7 O . 70 M 

y 1 «Z * aa 


u 


ASQ 


On^ 
ZU3 


IfiOA 

IOZ*T 


017 
Zl / 


1 nn n * 9 ns aa 

1UU«U • avJ cut 


r» 


AAn 


10A 

iz*t 


1A71 


A7 
O/ 


QR A * AA aa 


1^ 


AAn 


ioa 


1A79 
i*r /Z 


IfiA 
loo 


OA 7 • An aa 

7D. / • UU cUl 




OOl 


yj 


1 AIR 


yj 


QR Q • 0^ aa 
70.7 • 7 J aa 


15 
D 


AA9 

ooz 


sin 

5U 


I^Afi 


i7n 
1 /u 


OR 7 • 7Q aa 
7o* / • Iy aa 


D 


AA** 
00 J 


Z14 


17AO 
1 /47 


00A 
ZZH 


1 nn n • 91 a 

1 UU.U • Zl*r aa 


XI 
D 


AAA 
004 


7fi 
/5 


1 AAfi 
1000 


01 A 
Z14. 


07 7 - 7^ oa 
7 /.j • /J aa 


T\ 
U 


AAS 
OOJ 


177 
1 / / 


1 QAA 
1544 


OfiO 

yoz 


7R A • 1 7^ oa 
/5.0 • 1 / J aa 




AAA 
000 


O^fi 
ZD 5 


1 ^7fi 
13/5 


07^ 
Z/3 


OA 1 • O^A 00 
y4.1 » ZJ4 aa 




AA7 
00/ 


O^ 


1 OAO 
1 /4Z 


iAn 
30U 


1 nn n • 7^ oo 

1UU.U . / J Ha 


. 15 


AAQ 
00 o 


AOI 
4Z1 


1 AQQ 

loyy 


AOI 
4Z1 


OQ < • AOI 00 
yy.J I 4Z1 aa 


A 

A 


AAQ 

ooy 


fil 
51 


1 ^AO 
1 34Z 


AOO 

4zy 


1 nn n • c 1 00 

1UU.U • 51 aa 


xx 

D 


A7n 
O/U 


IIO 
1 1Z 


1 KAf\ 
134U 


0A0 

zoy 


7A 1 • in go 

/O.l ♦ 1 1 J oa 


X2 


A71 
0/ 1 


' IOA 
JZO 


1 AQO 
105V 


OAA 
Z40 


OO fi • Onfi aa 

yZ.5 • ZU5 aa 


■X2 


AOO 
0/Z 


Z/Z 


1O0O 
1 /Z/ 


OOO 

z/y 


1 nn n • 001 ao 
100.0 . z/i 


A 


ATI 
0/J 


Ofi 
/o 


1 AA1 
1001 


3y4 


1 nn n • oq «o 
loo.o . /5 aa 


0 


AOA 
0/4 


>t aa 
400 


101 A 
l/3o 


A1 0 

4iy 


1 nn a • Ann 

100.0 : 400 aa 


T% 

D 


675 


91 


1 CIO 

1522 


273 


95.5 : 88 aa 


D 


0/0 


126 


1 CC1 

1561 


131 


1 AA A • Mn 

100.0 : 126 aa 


A 


Oil 


1 AC 

195 


1577 


301 


HC C.I C\C MM 

76.5 : 196 aa 


b 


CIO 

678 


72 


101 >i 
1814 


90 


AO C • TO nM 

98.6 : 72 aa 


J3 


£OQ 

o/y 


111 


1572 


ioy 


1 AA A • 1 1 1 

loo.o : 111 aa 


TJ 

D 


£QA 

ooU 


1 VlO 

14Z 


1 OvlO 

184/ 


ono 

zo/ 


OO C • 11C aa 

yz.o 1 133 aa 




OOl 


Oi 


1 <i c 
1516 


AAO 

40/ 


OA yf • <A aa 

yo.4 : do aa 


u 


<oo 
OoZ 


1 1 A 

310 


1 oaa 
1700 


A AC 

440 


OA a . 0.A1 aa 

yy.3 : 301 aa 


Jt5 


683 


1 £A 

169 


1*7/1 A 

1744 


219 


1 AA A.I CCi aa 

100.0 : 169 aa 


15 


684 


or 

85 


1611 


1AH 

347 


OO 1 • OA mm 

88.1 : 84 aa 


b 


685 


77 


1829 


Z51 


AT 1 • *7C mm 

97.3 : 75 aa 


U 


OOO 


1 A"7 

197 


1 CIA 

1634 


/IOC 

4ZD 


OA A.I QA aa 

yy .0 : 1 y4 aa 


U 


AOO 
05/ 


1 ao 
14Z 


1 0OO 

1 /ZZ 


1 00 

iy / 


1 AA n • 1 3 A ao. 
1UU.U • IjO aa 


n 
u 


Afifi 
055 


fiA 

50 


1 fin< 

15UD 


57 
5/ 


1 nn n • fiA aa 

1UU.U • 50 aa 


A 

rx 


AfiQ 


101 
1Z1 


17<Q 

1 IDy 


ODO 


07 • 1 1 fi on 
y / .J • 115 aa 




AOH 

oyu 


J7 


17fiO ■ 
1 ioy 


77 
/ / 


RA A • S9 00 
OH.O • -JZ aa 


F 


AQ1 


1 AO 
14Z 


1 A7<k 
10/D 


Aon 

4ZU 


07 1*1 An oa 
y /.I • l*rU oa 


R 


AGO 
OyZ 


sn 
50 


1700 
1 /ZZ 


107 

iy / 


. oa 0 . An oa 
OJ»J • OU aa 


Ij 


AOI 

oy> 


1 sn 

15U 


1 A^O 
103V 


1R7 
15/ 


1 nn n • 1 70 aa 

1UU.U . 1 /7 oa 


R 


AOA 
0y4 


7^ * 

/D 


1 ^OO 


1 £.0 
1 Dy 


1 nn n * 7^ oa 

I UU.U . / J oa 


r> 


AQS 
Oyj 


QQ 


1 fiAQ 


110 

1 iy 


1 nn n * qo aa 

1UU.U • 77 oa 




AQA 
O7O 


ono 

ZV7 


17m 

1 /Ul 


A^n 


QQ ^ * 9n^ aa 
77.J * ZUJ oa 




A07 


7^ 


1 A1 A 
IOIO 


1QA 


1 nn n • 7S aa 

1UU.U • / J oa 


R 


AQR 
070 


101 

1Z1 


1 A7fi 


AAn 


inn n * 191 aa 


R 


AQQ 
oyy 


mo 

1UZ 


1S7Q 
iD iy 


1^0 


QQ n • 1 nn aa 
77. v • lw oa 


R 


7nn 
/uu 


111 
13 1 


1 ^OS 
IDyj 


17^ 


QQ 0 • 1^1 aa 

yy.z • i3i oa 




Oni 
/01 


AA1 
441 


1 Afifi 
1455 


AAA 
444 


QA Q • A9A aa 
yO.y • *rZO aa 




ono 
/UZ 


lOO 
1Z/ 


1 OOA 
1 / /4 


aoo 

3Z/ 


QO 1 • 1 1 A ao 

yy.i . 1 10 aa 


jj 


70^ 


9R7 


1 /UJ 


^77 


95 9 • 970 aa 

7J.7 . / 7 oa 


F 


704 


46 


1581 


111 


95.2 : 21 aa 


D 


705 


115 


1647 


117 


100.0: 115 aa 


A 


706 


86 


1463 


111 


100.0 : 86 aa 


B 


707 


44 


1588 


68 


100.0 : 44 aa 


B 


708 


560 


1589 


1413 


98.8: 560 aa 


B 



WO 96/40893 



-68- 



709 


125 




441 


7J.o • X Xo aE 


D 


710 


96 


1469 


540 


inn n • 7i qo 

XUU.U • /J oa 


D 


711 


81 


1629 


944 


70.J • oU 22 


D 


712 


141 


1671 

Ivr 1 


375 


inn n • 195 oa 


• r\ 
V 


713 




1867 
1 oo/ 


101 

X7J 


yOA • di aa 


o 


714 


233 


1833 


940 
Z*fU 


OO < • OI 0 on 

77.5 . zio aa 


T*V 

D 


715 


266 


1644 


971 
Z/ X 


inn n • oo 
iuu.u * zo5 aa 


B 


716 


158 


1883 


996 


OA 7*1 <n oo 

V4./ . i ju aa 


T? 

E 


717 


1Q1 

171 


1860 


94fi 


iuu.u : io7 aa 


B 


718 

/ 1 o 


943 


1817 
1 ox / 


9^6 
Z30 


77.o : Z43 aa 


B 


710 

#17 


936 


1617 

XOX / 


£A7 
OO / 


9o.i . Z3o aa 


B 


720 


175 


1593 


509 


07 1 • 1 TO on 

7/.1 . i /z aa • 


B 


721 


226 


1770 

1 / /U 


140 


OO 7 • 0 1 O oo 

7Z./ • zio aa 


E 


777 


7Q 
ty 


1699 


944 


inn n • *ia oo 

iuu.u : 74 aa 


D 


791 


65 


150A 
iOUO 


07O 
ZZV 


1 AA A • VIA _~ 

iuu.u : 49 aa 


D 


17A 


lift 


1£7* 


4Z0 


A1 O ■ 1 — — 

93.2 : 132 aa 


E 


/ZD 


oi n 
Z1U 


1 MO 

153Z 


Hie 

215 


100.0 : 210 aa 


A 


/Zu 


0O£ 

ZVO 


loZl 


223 


100.0 : 190aa 


D 


/Zo 


29o 


1819 


640 


99.1 : 106 aa 


D 


727 


a>* 


1706 


*\ Ai\ 

249 


97.8 : 91 aa 


D 


728 


83 


1593 


I7l 


100.0 : 83 aa 


B 


/29 


203 


1 A AO 

1448 


268 


100.0 : 202 aa 


B 


730 


220 


1656 


242 


100.0 : 220 aa 


B 


/JI 


116 


1 01 O 

1818 


196 


100.0 : 115aa 


B 


152. 


248 


1 A 0 ! A 

1474 


248 


100.0 : 248 aa 


A 


nil 
155 


1 <A 

150 


i /ten 
1459 


347 


4 O.O. a * « n 

100.0 : 118aa 


D 


I5*¥ 


Z2o 


1545 


302 


96.9 : 227 aa 


B 




18/ 


1597 


432 


100.0: 172 aa 


D 


/5o 


i no 
19o 


1 Ha~\ 

• l/ol 


354 


100.0 : 198 aa 


B 


yj / 


ZU8 


1780 


438 


/^o o o\on 

93.8 :208aa 


E 


71Q 

/Jo 


ZU1 


1 QA1 

184Z 


384 


100.0 : 193 aa 


D 


I5y 


lit 


1 4C£A 

10u9 


405 


21.6 : 74 aa 


E 


74A 


fy 


1615 


AHA 

280 


97.3 : 75 aa 


D 


741 
/41 


4G 

**y 


lo/o 


/I vl O 

443 


91.3: 46 aa 


E 


747 
/*fZ 


41Z 


lo/ / 


425 


97.8 : 412 aa 


B 


/HJ 


ftO 


i*roZ 


ill 


1 AA A . OA 

100.0 : 89 aa 


B 


744 


1 oo 
I7Z 


13UZ 


ZZZ 


AO A • 1 OA _ ~ 

98.4 : 189 aa 


B 


745 


xuy 


1707 
I /7/ 


1114 

JU4 


A^ 1 • 1 AA 

70*3 : 109 aa 


B 


/*tO 


1 1 a 
1 io 


1701 
X /5M 


5Dy 


c\a a . 1 1 a 

94.8 : 116 aa 


B 


74.7 
/*t / 


i£i 


171 1 
1 / X X 


191 
JZX 


inn A.i 
iuu.u : loi aa 


B 


748 
/*to 


40 


1481 


177 
<3 / / 


i nn n » /in oo 
i uu.u * 4U aa 


O 


749 


45 


1770 
X / /u 


140. 


Q< 7 * 41 oo 

VD.z . 4Z aa 


U 


750 


1 18 

X 1 o 


1867 


101 
xyj 


OO O • 11/: aa 

yz,z . i io aa 


Jb 


751 


200 


1491 


91 1 

Z.X X 


inn n • onn «o 
xuu.u . zuu aa 


r> 


752 


79 


1890 

X 07U 


80 


i nn n • 7Q oo 
xuu.u . iy aa 


A 


753 


98 


1837 

X O J 1 


968 
zoo 


Ofi 7 * 70 oo 

7o. / . iy aa 


u 


754 


146 


1653 


413 
*tx J 


inn n • 141 oo 
xuu.u . i*u aa 


£) 


755 


706 


1638 


577 
Off 


Ol 7 . OA£ oo 

7J./ : zuo aa 


B 


756 


Q7 
y / 


1 596 

X JZO 


$0 
07 


i nn a • aa 

iuu.u : oo aa 


D 


757 


197 


1794 


198 


1 00 0 * 1 07 
XUU.U • 17/ oa 


A 


758 


149 


1872 


94 


97.5 : 80 aa 


D 


759 


68 


1882 


291 


100.0 : 68 aa 


B 


760 


135 


1631 


162 


99.3 : 134 aa 


B 


761 


93 


1884 


594 


100.0: 69 aa 


D 


762 


73 


1883 


226 


100.0 : 73 aa 


B 


763 


268 


1859 


430 


95.4 : 263 aa 


B 



WO 96/40893 



PCT/US96/09I22 



-69- 



7A4 


OTA 

2/4 


1 ^1T 
133/ 


TTT 

277 


1AA A • 071 ««. 

100.0 . 2/3 aa 


B 


/UJ 


1 14 
1 34 


1714 
1 / 14 


1 

130 


QQ T • 1 71 am 

yy.2 . 133 aa 


A 




TT 
/Z 


i sin 
ioiu 


TA 
/4 


0< 0 • T1 an 

y3.o . / 1 aa 


A 


i\jf 


147 


18A7 


TO* 
253 


1AA A • 14T ao 

iuu.u . in/ aa 


B. 


768 


1/14 


1 S41 


TA1 
241 


1AA A • 1AA aa 

loo.o . 144 aa 


B 




47 


1 UCrr 


7* 
/3 


1 AA A • 47 an 

i uo*u . 4 / aa 


B 


770 


170 

1^7 


1 sn^ 


1 81 
153 


OA 7 • 1 77 aa 

?u*z . 122 aa 


b 


771 


ZtZ 




233 


OQ 1 • TIT aa 

yy. l . 232 aa 


C 


111 


07 

y i 


18S1 
1031 


1 TO 

i2y 


OO A • OT act 

yy.o • y / aa 


B 


1 Ij 


141 
143 


1520 


23y 


96.5 : 143 aa 


B 


1 /4 


71 0 
Z15 


illl 


353 


OA C • T1 T — 

99.5 :217aa 


B 


1 /3 


OA 
54 


1 itlT 

1037 


143 


1 A/\ /\ . O A 

100.0 : 84 aa 


B 


1 10 


T£ 
/0 


1 //4 


ITT 

327 


AC A . "™ 

95.9 : 73 aa 


D 


III 


133 


i <o< 
1353 


263 


79.7 : 133 aa 


E 


1 lo 


iyz 


1 A OA 

1480 


TAA 

299 


91.5 : 189 aa 


E 


1 ly 


1 T1 

1/1 


i <<i 
1331 


186 


A A A . 

99.4 : 171 aa 


B 


70A 
/oU 


1 A1 
141 


1 /OU 


366 


AA O . 1 OA — 

99.2 : 129 aa 


D 


T01 
/51 


TA 

l\) 


1 TA 1 
1 /41 


95 


1 AA A * *7/"\ _ 

100.0 : 70 aa 


B 


TOT 

\ /5Z 


1 33 


105/ 


223 


AO A . 1 CO " ~ 

98.0 : 153 aa 


B 


TOl 
/53 


1 Ol 

153 


1 O£0 

1509 


1 OA 

184 


t A/\ A . « o a 

100.0 : 183 aa 


A 


TOvl 

/54 


Ol 


1 O/HC 

1540 


231 


100.0 : 67 aa 


B 


TOS. 

/53 


T<.A 

234 


1 /30 


256 


100.0 : 253 aa 


B 


TO£ 
/OO 


1 Tl 

173 


1 A££ 

1466 


251 


100.0 : 166 aa 


D 


TOT 

/5/ 


T<0 

zsy 


1 OAO 

1808 


aaa 

322 


4 /\/\ a . aan 

100.0 : 238 aa 


D 


TOO 

758 


294 


1786 


131 


na t ^ 0% ^ 

93.7 : 126 aa 


E 


TOO 

loo 


TO/1 

294 


1 /CO A 

1630 


i ^o 
163 


94.0 : 116aa 


E 


TOA 

789 


i a a 

194 


1655 


366 


99.2 : 122 aa 


D 


TOO 

789 


1 A/t 

194 


1646 


75 


« aa a 

100.0 : 72 aa 


C 


TOA 


132 


1790 


273 


100.0: 132 aa 


B 


TQ1 

/yi 


213 


1 CTC 

1575 


n/ 
216 


/\n 4 a * a 

98.1 : 213 aa 


A 


TOT 

/yz 


An 
4/ 


1 £AC 

1005 


86 


1 /\a a jt *y 

100.0 : 47 aa 


B 


TOl 

lyj 


143 


1 OiC/f 

1804 


0 >• A 

342 


Aa s\ i ^ a 

93.0 : 143 aa 


E 


TOA 

/y4 


oy 


i nan 
1760 


AO 

93 


1 /\/\ a . 

100.0 : 69 aa 


B 


TO< 

/yo 


144 


1 CA/1 

1304 


CIO 

519 


nc 1 a n 

95.7 : 138aa 


D 


TO£ 

/yo 


1 OA 

190 


1 CftA 

1590 


193 


i /\ a a 4 Aa 

100.0 : 190 aa 


A 


TOT 

/y/ 


OA 

54 


1 TOO 

1 /92 


TIC 

215 


83.5 : 79 aa 


E 


TOO 

/y© 


13D 


1 £L\ 1 
1013 


138 


100.0 : 135 aa 


A 


/yy 


T<< 

233 


1 QQC 

1553 


/IT/1 

424 


AA £. . ACO — _ 

99.6 : 253 aa 


B 


qaa 
500 


TA1 

243 


1 AOT 

i4y3 


one 

205 


OT 1 - 1 A A — . — 

87.1 : 140 aa 


E 


saa 

oUU 


TA1 

243 


1 AQT 

i4y / 


in 
131 


1 AA A . 11/ 

100.0 : 126 aa 


C 


©Ul 


7^7 
232 


i ^<i 

1303 


TC< 

233 


1 AA A * TCT — ~>. 

100.0 : 252 aa 


A 


RA7 
ou£ 


1 nf. 


1770 
1 /ZU 


01 

yi 


1 AA A • OO 

loo.o : 88 aa 




80? 

Ova 




171R 
1 / I o 


00 

yy 


OT T • OT aa 

yz. / : 52 aa 


h 


OUJ 


171 


1 AA7 


707 

zy / 


Of C.I CA Mn 

53.3 : 139 aa 


X7 
t. 


out 




14JK 


1 fS, 
103 


1 AA A • <1 aa 

iuo.0 : oi aa 


T\ 

D 


0\JJ 


1QQ 

iw 


1 ^14 
1314 


T1 Q 

ziy 


1 AA A . 1 AA 

100.0 : 199 aa 


B 


8AA 

OUU 


131 


1 QQA 
1550 


1Q1 

391 


OZ A * i a A 

86.2: 130 aa 


E 


OAT 
OU/ 


1 A1 

lul 


1 T1 O 

i /iy 


TCT 

257 


AC A 1 AA 

95.0 : 100 aa 


B 


QAO 
505 


OA 

50 


1 CQC 

1353 


263 


A a a a a 

98.8 : 80 aa 


B 


OAQ 

ouy 


a ai 
44/ 


1 TO 1 

1/51 


A A 

344 


99.7 : 331 aa 


D 


809 


447 


1781 


118 

1 1 o 


DO 1 . 1 1 *7 _ a 

7o.j . 1 1 / aa 


r» 
U 


810 


218 


1564 


250 


100.0: 218 aa 


B 


811 


192 


1478 


408 


99.5 : 192 aa 


B 


812 


341 


1729 


354 


100.0 : 340 aa 


B 


813 


201 


1523 


592 


100.0: 158 aa 


D 


814 


130 


1769 


306 


92.3 : 130 aa 


E 



WO 96/40893 PCT/US96/09122 



-70- 



81 K 
513 


1 AA 

144 


1 875 
1533 


548 


78.4 : 116 aa 


E 


81 5 


1 A A 

144 


181 5 


777 
Oil 


84.3 : 121 aa 


E 


816 


7AQ 

JV7 


1RQ4 

107*T 


71 A 
31U 


1 A A A • O AA 

iuo.u : 309 aa • 


A 


817 


1£4 

1 D4 


1897 

A OZ J 


1 <A 
130 


96.7 : 153 aa 


D 


818 

Ol o 


5A 
34 


1707 


57 
5/ 


1 AA A m d — — 

100.0: 53 aa 


B 


81Q 


909 
ZUZ 


1716 

1 / 1U 


91 <E 
zlD 


OA C • OA1 

yy.j : 202 aa 


B 


890 
ozu 


1 AO 


10 /U 


lie 
1 13 


1 A A A « l 1 A , 

100.0 : 110 aa 


D 


891 

OZ1 


• 1 8A 
150 


1A51 
143 1 


7C4 

Z34 


1 A A A - 10/ . 

100.0 : 186aa 


B 


ozz 


/Z 


1A0Q 

10U7 


1 OA 

loU 


f\C / , f ft . 

96.6 : 58 aa 


D 


897 . 
5Z3 


190 
Izz 


1 A77 
1033 


hha 
004 ^ 


f\A A . lift " 

94.9 : ll8 aa 


D 


B9A 


13 


1 AO^ 
109 D 


*7A 

70 


100.0 :13 aa 


B 


894 
5Z4 


- 1 7 


1 *£7 
130/ 


392 


100.0 : 13 aa 


B 


97 < 
5z3 


GO 

99 


i 771 
1 / 11 


1 >1 A 

144 


100.0: 86 aa 


D 


Q7£ 
5z6 


71 


1512 


1 C A 

154 


98.6 : 71 aa 


B 


©77 

5z/ 


1 <A 


1430 


139 


93.7 : 127 aa 


E 


coo 

OZO 


255 


1651 


155 


100.0: 155 aa 


C 


OZO 


net 

255 


1 £.A O 

1648 


109 


98.0: 102 aa 


D 


829 


1 A^ 

106 


1726 


% A*1 

147 


99.1 : 106 aa 


B 


OO A 

830 


527 


1509 


567 


99.8 : 524 aa 


B 


831 


124 


1 n A f\ 

1740 


214 


99.2 : 123 aa 


B 


832 


110 


1690 


400 


98.2 : HOaa 


B 


833 


326 


1856 


325 


100.0: 325 aa 


C 


Ci A 

834 


253 


1543 


191 


96.7: 184aa 


D 


835 


597 


1876 


638 


100.0: 594 aa 


B 


836 


156 


t ^n 1 
1621 


345 


99.4: 154 aa 


B 


837 


157 


1624 


253 


97.4 : 155 aa 


B 


On O - 

838 


191 


1465 


241 


96.8 : 190 aa 


B 


839 


90 


1548 


231 


92.9 : 85 aa 


E 


OA A 

840 


277 


1682 


298 


99.6 : 277 aa 


B 


OA 1 

841 


58 


1508 


270 


98.1 : 54 aa 


D 


842 


139 


1734 


139 


88.4 : 129 aa 


E 


843 


nnA 

279 


1476 


276 


4 Af\. *f\ A^. ^ A**. 

100.0 : 269 aa 


C 


OA A 

844 


36 


1852 


83 


94.4 : 36 aa 


B 


QA< 

843 


1 OA 

129 


1 A £.t\ 

1460 


196 


100,0: 129 aa 


B 


546 


127 


1473 


172 


100.0 : 105 aa 


D 


<M7 

54/ 


On/^ 

276 


1657 


n * A 

319 


100.0 : 276 aa 


B 


545 


1 o^ 
J26 


1 OA1 

1801 


187 


98.4 : 126 aa 


B 


549 


O/IO 

z4z 


1669 


A AC 

405 


97.0 : 236 aa 


D 


850 

53U 


101 


i ©i x 
1510 


Oil 

217 


1 AA f\ . 1 1 _ 

100.0 : 15l aa 


B 


851 
531 


1 <A 

134 


1 AA1 

1041 


194 


1 AA i\ - 1 ^ A 

100.0 : 154 aa 


B 


859 
03z 


AO£ 

490 


1003 


327 


An r . 11 >i 

97.5 : 314 aa 


C 


857 


i /y 


1 AOA 
10U0 


777 
111 


An 0.1 *?n 

97.2 : 179 aa 


B 


855 


i ao 

10U 


1 A87 
145 / 


195 


r\/r A.I dZf\ — ~ 

96.9 : 160aa 


B 


OJO 


74 


1 87 A 
1530 


A7A 

4zU 


1 A A A • AT 

iuu.0 : 93 aa 


B 


857 


1 

140 


1543 


771 
J/1 


99.3 : 146 aa 


B 


858 


905 
. ZU3 


1 507 
13U / 


300 


98.3 : 202 aa 


B 


850 


177 
1 / J 


1 897 
10Z / 


1 OQ 

195 


1 AA A • 1 A A * M • 

10U.0 : 149 aa 


D 


8^n 
oou 


107 

iyj 


1 fSL\ 
105 1 


7 OA 

z94 


1 AA A . 1 *TC 

1UU.U : 175 aa 


D 


8A1 
501 


1U4 


. 1503 


oca 
253 


1 A A A - 1 f\ A 

100.0 : 104 aa 


B 


862 


355 

J *J »J 


1599 


758 


100 0 • 755 00 
luu.u . 333 aa 


15 


863 


93 


1825 


236 


97.8 : 92 aa 


B 


864 


82 


1677 


315 


97.5 : 81 aa 


B 


865 


234 


1861 


249 


99.1 :233aa 


B 


866 


239 


1652 


405 


99.2 : 238 aa 


B 


867 


86 


1468 


201 


77.4: 84 aa 


E 


868 


289 


1574 


395 


94.4: 284 aa 


E 



WO 96/40893 



PCT/CJS96/09122 



-71- 



869 


436 


1863 


662 


100.0 : 436 aa 


B 


870 


114 


1560 


121 


100.0: 114 aa 


B 


871 


173 


1757 


362 


98.8 : 173 aa 


B 


872 


116 


1866 


145 


98.1 : 105 aa 


D 


873 


127 


1797 


304 


63.9 : 83 aa 


E 


873 


127 


1799 


270 


89.7 : 58 aa 


E 


873 


127 


1800 


62 


97.4 : 38 aa 


D 


873 


127 


1801 


. 187 


82.0 : 50 aa 


E 


874 


ISO 


1461 


179 


100.0: 150 aa 


B 


875 


2Q9 


1511 


339 


98.9 : 188 aa 


D 


876 \ 


234 


1739 


615 


99.6 : 230 aa . 


B 


877 


395 


1830 


879 


99.7 : 379 aa 


D 


878 


103 


1623 


413 


97.0 : 99 aa 


D 


879 


265 


1809 


566 


89.8 : 265 aa 


B 


880 


364 


1576 


442 


98.6 : 346 aa 


D 



IV. Identification of Nucleic Acids Encoding Vaccine Components and Targets for Agents 
Effective Against H. pylori 

The disclosed H. pylori genome sequence includes segments that direct the 
5 synthesis of ribonucleic acids and polypeptides, as well as origins of replication, promoters, 
other types of regulatory sequences, and intergenic nucleic acids. The invention 
encompasses nucleic acids encoding immunogenic components of vaccines and targets for 
agents effective against H. pylori. Identification of said immunogenic components 
involved in the determination of the function of the disclosed sequences, which can be 
1 0 achieved using a variety of approaches. Non-limiting examples of these approaches are 
described briefly below. 

Homology to known sequences: Computer-assisted comparison of the disclosed H. 
pylori sequences with previously reported sequences present in publicly available databases 
is useful for identifying functional K pylori nucleic acid and polypeptide sequences. It will 
15 be understood that protein-coding sequences, for example, may be compared as a whole, 
and that a high degree of sequence homology between two proteins (such as, for example, 
>80-90%) at the amino acid level indicates that the two proteins also possess some degree 
of functional homology, such as, for example, among enzymes involved in metabolism, 
DNA synthesis, or cell wall synthesis, and proteins involved in transport, cell division, etc. 
20 In addition, many structural features of particular protein classes have been identified and 
correlate with specific consensus sequences, such as, for example, binding domains for 
nucleotides, DNA, metal ions, and other small molecules; sites for covalent modifications 
such as phosphorylation, acylation, and the like; sites of proteimprotein interactions, etc. 
These consensus sequences may be quite short and thus may represent only a fraction of the 
25 entire protein-coding sequence. Identification of such a feature in an H. pylori sequence is 
therefore useful in determining the function of the encoded protein and identifying useful 
targets of antibacterial drugs. 

Of particular relevance to the present invention are structural features that are 
common to secretory, transmembrane, and surface proteins, including secretion signal 
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peptides and hydrophobic transmembrane domains. H. pylori proteins identified as 
containing putative signal sequences and/or transmembrane domains are useful as 
immunogenic components of vaccines. 

Identificati on of essential genes: Nucleic acids that encode proteins essential for 
5 growth or viability ofK pylori are preferred drug targets. H. pylori genes can be tested for 
their biological relevance to the organism by exarninmg me effect of deleting and/or 
disrupting the genes, i.e., by so-called gene "knockout", using techniques known to those 
skilled in the relevant art In this manner, essential genes may be identified. 

Strain-specific sequences: Because of me evolutionary relationship between 

10 different H. pylori strains, it is believed that the presently disclosed H. pylori sequences are 
useful for identifying, and/or discriminating between, previously known and new H. pylori 
strains. It is believed that other H. pylori strains will exhibit at least 70% sequence 
homology with the presently disclosed sequence. Systematic and routine analyses of DNA 
sequences derived from samples containing H. pylori strains, and comparison with the 

15 present sequence allows for the identification of sequences that can be used to discrirninate 
between strains, as well as those that are common to all H. pylori strains. In one 
embodiment, the invention provides nucleic acids, including probes, and peptide and 
polypeptide sequences that discriminate between different strains of H /y/or/. Strain- 
specific components can also be identified functionally by their ability to elicit or react with 

20 antibodies that selectively recognize one or more H. pylori strains. 

In another embodiment, the invention provides nucleic acids, including probes, and 
peptide and polypeptide sequences that are common to all H. pylori strains but are not 
found in other bacterial species. 

25 Specific Ex ample: Determination Of Candidate Protein Antigens For Antibody And 

Vaccine Development 

The selection of candidate protein antigens for vaccine development can be derived 

from the nucleic acids encoding H. pylori polypeptides. First, the ORF's can be analyzed 

for homology to other known exported or membrane proteins and analyzed using the 
30 discrirmnant analysis described by Klein, et al. (Klein, P., Kanehsia, M, and DeLisi, C. 

(1985) BiochimicaetBiophysicaActa 815, 468-476) for predicting exported and membrane 

proteins. 

Homology searches can be performed using the BLAST algorithm contained in the 
Wisconsin Sequence Analysis Package (Genetics Computer Group, University Research 
35 Park, 575 Science Drive, Madison, WI 537 1 1 ) to compare each predicted ORF amino acid 
sequence with all sequences found in the current GenBank, SWISS-PROT and PIR 
databases. BLAST searches for local alignments between the ORF and the databank 
sequences and reports a probability score which indicates the probability of finding this 
sequence by chance in the database. ORF's with significant homology (e.g. probabilities 
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* 

lowfer than 1x1 0" 6 that the homology is only due to random chance) to membrane or * 
exported proteins represent protein antigens for vaccine development Possible functions 
can be provided to H. pylori genes based on sequence homology to genes cloned in other 
organisms. 

5 Discriminant analysis (Klein, et al. supra) can be used to examine the ORF amino 

- acid sequences. This algorithm uses the intrinsic infoniiation contained in the ORF ainino 
acid sequence and compares it to information derived from the properties of known 
membrane and exported proteins. This comparison predicts which proteins will be 
exported, membrane associated or cytoplasmic. ORF amino acid sequences identified as 

1 0 exported or membrane associated by this algorithm are likely protein antigens for vaccine 
development - 

Infrequently it is not possible to distinguish between multiple possible nucleotides 
at a given position in the nucleic acid sequence. In those cases the ambiguities are denoted 
by an extended alphabet as follows: 

15 These are the official IUPAC-IUB single-letter base codes 



Code 


Base Description 




G 


Guanine 




A 


Adenine 




T 


Thymine 




C 


Cytosine 




R 


Purine 


(AorG) 


Y 


Pyrimidine 


(CorTorU) 


M 


Amino 


(AorC) 


K 


Ketone 


(GorT) 


S 


Strong interaction 


(CorG) 


W 


Weak interaction 


(AorT) 


H 


Not-G 


. (AorCorT) 


B 


Not-A 


(Cor GorT) 


V 


Not-T (not-U) 


(A or CorG) 


D 


Not-C 


(AorGorT) 


N 


Any 


(A or Cor GorT) 



The amino acid translations of this invention account for the ambiguity in the 
nucleic acid sequence by translating the ambiguous codon as the letter "X". In all cases, the 
20 permissible amino acid residues at a position are clear from an examination of the nucleic 
acid sequence based on the standard genetic code. 
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V. Production of Fragment s and Analogs of H. pylori Nucleic Acids and Polypeptides 

Based on the discovery of the H. pylori gene products of the invention provided in 
the Sequence Lsiting, one skilled in the art can alter the disclosed structure (of H. pylori 
genes), e.g., by producing fragments or analogs, and test the newly produced structures for 
5 activity. Examples of techniques known to those skilled in the relevant art which allow the 
production and testing of fragments and analogs are discussed below. These, or analogous 
methods can be used to make and screen libraries of polypeptides, e.g., libraries of random 
peptides or libraries of fragments or analogs of cellular proteins for the ability to bind H. 
pylori polypeptides. Such screens are useful for the identification of inhibitors of H. pylori 

10 

Generation of Fragments 

Fragments of a protein can be produced in several ways, e.g., recombinantly, by 
proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 

1 5 terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes 
the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments. 
Digestion with "end-nibbling" endonucleases can thus generate DNA's which encode an 
array of fragments. DNA's which encode fragments of a protein can also be generated by 
random shearing, restriction digestion or a combination of the above-discussed methods. 

20 Fragments can also be chemically synthesized using techniques known in the art 

such as conventional Merrifield solid phase f-Moc or t-Boc chemistiy. For example, 
peptides of the present invention may be arbitrarily divided into fragments of desired length 
with no overlap of the fragments, or divided into overlapping fragments of a desired length. 

25 Alteration of Nucleic Acids and Polypeptides: . Random Methods 

Amino acid sequence variants of a protein can be prepared by random mutagenesis 
of DNA which encodes a protein or a particular domain or region of a protein. Useful 
methods include PCR mutagenesis and saturation mutagenesis. A library of random amino 
acid sequence variants can also be generated by the synthesis of a set of degenerate 
30 oligonucleotide sequences. (Methods for screening proteins in a library of variants are 
elsewhere herein). 

(A) PCR Mutagenesis 

In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce random 
mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1 :1 1-15). The 
35 DNA region to be mutagenized is amplified using the polymerase chain reaction (PCR) 
under conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerase, e.g., 
by using a dGTP/dATP ratio of five and adding Mn 2+ to the PCR reaction; The pool of 
amplified DNA fragments are inserted into appropriate cloning vectors to provide random 
mutant libraries. 
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(g> Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large number of single 
base substitutions into cloned DNA fragments (Mayers et aL, 1985, Science 229:242). This 
technique includes generation of mutations, e.g., by chemical treatment or irradiation of 

5 single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The 
mutation frequency can be modulated by modulating the severity of the treatment, and 
essentially all possible base substitutions can be obtained* Because this procedure does not 
involve a genetic selection for mutant fragments both neutral substitutions, as well as those 
that alter function, are obtained The distribution of point mutations is not biased toward 

10 conserved sequence elements. 

(O Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate 
oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried 
out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 

1 5 appropriate expression vector. The synthesis of degenerate oligonucleotides is known in 
the art (see for example, Narang, SA (1983) Tetrahedron 39:3; ItakuraetaL (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Armu. Rev. Biochem. 53:323; Itakura 
etal. (1984) Sc/ewce \9%:l056;1kettdi.(l9S3) Nucleic Acid Res. 11:477. Such techniques 

20 have been employed in the directed evolution of other proteins (see, for example, Scott et 
al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et aL 
(1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. 
Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

25 Alteration of Nucleic Acids and Polypeptides: Methods for Directed Mutagenesis 

Non-random or directed, mutagenesis techniques can be used to provide specific 
sequences or mutations in specific regions. These techniques can be used to create variants 
which include, e.g., deletions, insertions, or substitutions, of residues of the known amino 
acid sequence of a protein. The sites for mutation can be modified individually or in series, 
30 e.g., by (1) substituting first with conserved amino acids and then with more radical choices 
depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of 
the same or a different class adjacent to the located site, or combinations of options 1-3. 
fA^ Alanine Scanning Mutagenesis 

Alanine scanning mutagenesis is a useful method for identification of certain 
35 residues or regions of the desired protein that are preferred locations or domains for 

mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine scanning, 
a residue or group of target residues are identified (e.g., charged residues such as Arg, Asp, 
His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most 
preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction 
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of the amino acids with the surrounding aqueous environment in or outside the cell. Those 
domains demonstrating functional sensitivity to the substitutions are then refined by 
introducing further or other variants at or for the sites of substitution. Thus, while the site 
for introducing an amino acid sequence variation is predetermined, the nature of the 
5 mutation perse need not be precfetermined. For example, to optimize the performance of a 
mutation at a given site, alanine scanning or random mutagenesis may be conducted at the 
target codon or region and the expressed desired protein subunit variants are screened for 
me optimal combination of desired activity. 

(B) Oligonucleotide-Mediated Mutagenesis 

10 Oligonucleotide-mediated mutagenesis is a useful method for preparing 

substitution, deletion, and.insertion variants of DNA, see, e.g., Adelman et al., (DNA 2:183, 
1983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a 
mutation to a DNA template, where the template is the single-stranded form of a plasmid or 
bacteriophage containing the unaltered or native DNA sequence of the desired protein. 

15 After hybridization, a DNA polymerase is used to synthesize an entire second 

complementary strand of the template that will thus incorporate the oligonucleotide primer, 
and will code for the selected alteration in the desired protein DNA. Generally, 
oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide 
will have 12 to 15 nucleotides that are completely complementary to the template on either 

20 side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will 
hybridize properly to the single-stranded DNA template molecule.. The oligonucleotides 
are readily synthesized using techniques known in the art such as that described by Crea et 
al. (Proc. Natl. Acad Sci. USA, 75: 5765[1978]). 

(C) Cassette Mutagenesis 

25 Another method for preparing variants, cassette mutagenesis, is based on the 

technique described by Wells et al. (Gene, 34:3 15 [1985]). The starting material is a 
plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 
codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique 
restriction endonuclease.site on each side of the identified mutation site(s). If no such 

30 restriction sites exist, they may be generated using the above-described oligonucleotide- 
mediated mutagenesis method to introduce them at appropriate locations in the desired 
protein subunit DNA. After the restriction sites have been introduced into the plasmid, the 
plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the 
sequence of the DNA between the restriction sites but containing the desired mutation(s) is 

35 synthesized using standard procedures. The two strands are synthesized separately and then 
hybridized together using standard techniques. This double-stranded oligonucleotide is 
referred to as the cassette. This cassette is designed to have 3' and 5' ends that are 
comparable with the ends of the linearized plasmid, such that it can be directly ligated to 
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the plasmid. This plasmid now contains the mutated desired protein subunit DNA 
sequence. 

CD) Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants (Ladner et aL, WO 
5 88/06630). In this method, the amino acid sequences for a group of homologs or other 
related proteins are aligned, preferably to promote the highest homology possible. All of 
the amino acids which appear at a given position of the aligned sequences can be selected 
to create a degenerate set of combinatorial sequences. The variegated library of variants is 
generated by combinatorial mutagenesis at the nucleic acid level, and is encoded by a 
10 variegated gene library. For example, a mixture of synthetic oligonucleotides can be 
enzymatically ligated into gene sequences such that the degenerate set of potential - 
sequences are expressible as individual peptides, or alternatively, as a set of larger fusion 
proteins containing the set of degenerate sequences. 

15 Other Modifications of K pylori Nucleic Acids and Polypeptides 

It is possible to modify the structure of an H. pylori polypeptide for such purposes 
as increasing solubility, enhancing stability (e.g., shelf life ex vivo and resistance to 
proteolytic degradation in vivo). A modified H. pylori protein or peptide can be produced 
in which the amino acid sequence has been altered, such as by amino acid substitution, 

20 deletion, or addition as described herein. 

An H. pylori peptide can also be modified by substitution of cysteine residues 
preferably with alanine, serine, threonine, leucine or glutamic acid residues to mini mira 
dimerization via disulfide linkages. In addition, amino acid side chains of fragments of the 
protein of the invention can be chemically modified. Another modification is cyclization of 

25 the peptide. 

In order to enhance stability and/or reactivity, an K pylori polypeptide can be 
modified to incorporate one or more polymorphisms in the amino acid sequence of the 
protein resulting from any natural allelic variation. Additionally, D-amino acids, non- 
natural amino acids, or non-amino acid analogs can be substituted or added to produce a 

30 modified protein within the scope of this invention. Furthermore, an H. pylori polypeptide 
can be modified using polyethylene glycol (PEG) according to the method of A. Sehon and 
co-workers (Wie et aL, supra) to produce a protein conjugated with PEG. In addition, PEG 
can be added during chemical synthesis of the protein. Other modifications of H. pylori 
proteins include reduction/alkylation (Tarr, Methods of Protein Microcharacterization, J. E. 

35 Silver ed., Humana Press, Clifton NJ 155-194 (1986)); acylation (Tarr, supra); chemical 
coupling to an appropriate carrier (Mishell and Shiigi, eds, Selected Methods in Cellular 
Immunology.mi Freeman, San Francisco, CA (1980), U.S. Patent 4,939^39; or mild 
formalin treatment (Marsh, (1971) Int. Arch, of Allergy andAppl Immunol, 41: 199 - 215). 
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To facilitate purification and potentially increase solubility of an H. pylori protein 
or peptide, it is possible to add an amino acid fusion moiety to the peptide backbone. For 
example, hexa-histidine can be added to the protein for purification by immobilized metal 
ion affinity chromatography (Hochuli, E. et al., (1988) Bio/Technology, 6: 1321 - 1325). In 
5 addition, to facilitate isolation of peptides free of irrelevant sequences, specific 

endoprotease cleavage sites can be introduced between the sequences of the fusion moiety 
and the peptide. 

To potentially aid proper antigen processing of epitopes within an H. pylori 
polypeptide, canonical protease sensitive sites can be engineered between regions, each 

1 0 comprising at least one epitope via recombinant or synthetic methods. For example, 

charged amino acid pairs, such as KK or RR, can be introduced between regions within a 
protein or fragment during recombinant construction thereof. The resulting peptide can be 
rendered sensitive to cleavage by cathepsin and/or other trypsin-like enzymes which would 
generate portions of the protein containing one or more epitopes. In addition, such charged 

1 5 amino acid residues can result in an increase in the solubility of the peptide. 

Primary Methods for Screening Polypeptides and Analogs 

Various techniques are known in the art for screening generated mutant gene 
products. Techniques for screening large gene libraries often include cloning the gene 

20 library into replicable expression vectors, transforming appropriate cells with the resulting 
library of vectors, and expressing the genes under conditions in which detection of a 
desired activity, e.g., in this case, binding to H. pylori polypeptide or an interacting protein, 
facilitates relatively easy isolation of the vector encoding the gene whose product was 
detected. Each of the techniques described below is amenable to high through-put analysis 

25 for screening large numbers of sequences created, e.g., by random mutagenesis techniques. 
(AS Two Hvbrid Systems 

Two hybrid assays such as the system described above (as with the other screening 
methods described herein), can be used to identify polypeptides, e.g., fragments or analogs 
of a naturally-occurring H. pylori polypeptide, e.g., of cellular proteins, or of randomly 
30 generated polypeptides which bind to an H. pylori protein. (The H. pylori domain is used 
as the bait protein and the library of variants are expressed as fish fusion proteins.) In an 
analogous fashion, a two hybrid assay (as with the other screening methods described 
herein), can be used to find polypeptides which bind a H. pylori polypeptide. 
(B) Display Libraries . 

35 In one approach to screening assays, the candidate peptides are displayed on the 

surface of a cell or viral particle, and the ability of particular cells or viral particles to bind 
an appropriate receptor protein via the displayed product is detected in a "panning assay". 
For example, the gene library can be cloned into the gene for a surface membrane protein of 
a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 
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88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 
18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for 
potentially functional peptide homologs. Fluorescently labeled ligands, e.g., receptors, can 
be used to detect homologs which retain ligand-binding activity. The use of fluorescently 
labeled ligands,.allows cells to be visually inspected and separated under a fluorescence 
microscope, or, where the morphology of the cell permits, to be separated by a 
fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protein on the surface of a viral particle. 
For instance, in the filamentous phage system, foreign peptide sequences can be expressed 
on the surface of infectious phage, thereby conferring two significant benefits. First, since 
these phage can be applied to affinity matrices at concentrations well over 10*3 phage per 
milliliter, a large number of phage can be screened at one time. Second, since each 
infectious phage displays a gene product on its surface, if a particular phage is recovered 
. from an affinity matrix in low yield, the phage can be amplified by another round of 
15 infection. The group of almost identical K coli filamentous phages M13, fd., and fl are 
most often used in phage display libraries.- Either of the phage gffl or gVIII coat proteins 
can be used to generate fusion proteins without disrupting the ultimate packaging of the 
viral particle. Foreign epitopes can be expressed at the NH2-terminal end of pin and phage 
bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner 
20 et al. PCT publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks 
et al. (1992) J. Biol Chem. 267:16007-16010; Griffiths et al. (1993) EMBOJ 12:725-734; 
Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461). 

A common approach uses the maltose receptor of K coli (the outer membrane 
protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037). 
25 Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
available for binding to ligands, e.g., to antibodies, and can elicit an immune response when 
the cells are aciniinistered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. 
(1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 37-45), and 
30 PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial surface 
structures have served as vehicles for peptide display. Peptides can be fused to pilin, a 
protein which polymerizes to form the pilus-a conduit for interbacterial exchange of genetic 
information (Thiry et al. (\9S9)Appl. Environ. Microbiol. 55, 984-993). Because of its 
role in interacting with other cells, the pilus provides a useful support for the presentation 
35 of peptides to the extracellular environment. Another large surface structure used for 
peptide display is the bacterial motive organ, the flagellum. Fusion of peptides to the 
subunit protein flagellin offers a dense array of many peptide copies on the host cells 
(Kuwajima et al. (1988) Bio/Tech. 6, 1080-1083). Surface proteins of other bacterial 
species have also served as peptide fusion partners. Examples include the Staphylococcus 
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protein A and the outer membrane IgA protease of Neisseria (Hansson et al. (1992) J. 
Bacteriol 174, 4239-4245 and Klauser et al. (1990) EMBOJ. 9, 1991-1999). 

In the filamentous phage systems and the LamB system described above, the 
physical link between the peptide and its encoding DNA occurs by the containment of the 
5 DNA within a particle (cell or phage) that carries the peptide on its surface. Capturing the 
peptide captures the particle and the DNA within. An alternative scheme uses the DNA- 
binding protein LacI to form a link between peptide and DNA (Cull et al (1992) PNAS 
USA 89:1865-1869). This system uses a plasmid containing the LacI gene with an 
oligonucleotide cloning site at its 3*-end. Under the controlled induction by arabinose, a 

1 0 Lacl-peptide fusion protein is produced. This fusion retains the natural ability of LacI to 
bind to a short DNA sequence known as LacO operator (LacO). By installing two copies 
of LacO on the egression plasmid, the Lacl-peptide fusion binds tightly to the plasmid that 
encoded it Because the plasmids in each cell contain only a single oligonucleotide 
sequence and each cell expresses only a single peptide sequence, the peptides become 

1 5 specifically and stably associated with the DNA sequence that directed its synthesis. The 
cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix 
of immobilized receptor to recover the complexes containing active peptides. The 
associated plasmid DNA is then reintroduced into cells for amplification and DNA 
sequencing to determine the identity of the peptide Iigands. As a demonstration of the 

20 practical utility of the method, a large random library of dodecapeptides was made and 

selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort 
of peptides was recovered, all related by a consensus sequence corresponding to a six- 
residue portion of dynorphin B. (Cull et al. (1992) Proc. Natl Acad Set U.S.A. 89-1 869) 
This scheme, sometimes referred to as peptides-on-plasmids, differs in two 

25 important ways from the phage display methods. First, the peptides are attached to the C- 
terminus of the fusion protein, resulting in the display of the library members as peptides 
having free carboxy termini. Both of the filamentous phage coat proteins, pin and pVIII, 
are anchored to the phage through their C-termini, and the guest peptides are placed into the 
outward-extending N-terminal domains. In some designs, the phage-displayed peptides are 

30 presented right at the amino terminus of the fusion protein. (Cwirla, et al. (1 990) Proc. 
Natl Acad ScL USA. 87, 6378-6382) A second difference is the set of biological biases . 
affecting the population of peptides actually present in the libraries. The LacI fusion 
molecules are confined to the cytoplasm of the host cells. The phage coat fusions are 
exposed briefly to the cytoplasm during translation but are rapidly secreted through the 

35 inner membrane into the periplasmic compartment, remaining anchored in the membrane 
by their C-terminal hydrophobic domains, with the N-termini, containing the peptides, 
protruding into the periplasm while awaiting assembly into phage particles. The peptides 
in the LacI and phage libraries may differ significantly as a result of their exposure to 
different proteolytic activities. The phage coat proteins require transport across the inner 
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membrane and signal peptidase processing as a prelude to incorporation into phage. . 
Certain peptides exert a deleterious effect on these processes and are underrepresented in 
the libraries (Gallop et al. (1994) J. Med Chem. 37(9):1233-1251). These particular biases 
are not a factor in the LacI display system. 

5 The number of small peptides available in recombinant random libraries is - 

enormous. Libraries of lO^-lO^ independent clones are routinely prepared. Libraries as 
large as 1 0* * recombinants have been created, but this size approaches the practical limit 
for clone libraries. This limitation in library size occurs at the step of transforming the 
DNA containing randomized segments into the host bacterial cells. To circumvent this 

1 0 limitation, an in vitro system based on the display of nascent peptides in polysome 

complexes has recently been developed. This display library method has the potential of 
producing libraries 3-6 orders of magnitude larger than the currently available 
phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 
expression of the peptides, and screening, is done in an entirely cell-free format. 

15 In one application of this method (Gallop et al. (1994) J. Med Chem. 37(9):1233- 

125 1), a molecular DNA library encoding 10*2 decapeptides was constructed and the 
library expressed in an R coli S30 in vitro coupled transcription/translation-system. 
Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a 
substantial proportion of the RNA in polysomes and yielding complexes containing nascent 

20 peptides still linked to their encoding RNA. The polysomes are sufficiently robust to be 
affinity purified on immobilized receptors in much the same way as the more conventional 
recombinant peptide display libraries are screened. RNA from the bound complexes is 
recovered, converted to cDNA, and amplified by PCR to produce a template for the next 
round of synthesis and screening. The polysome display method can be coupled to the 

25 phage display system. Following several rounds of screening, cDNA from the enriched 
pool of polysomes was cloned into a phagemid vector. This vector serves as both a peptide 
expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing 
vector for peptide identification. By expressing the polysome-derived peptides on phage, 
one can either continue the affinity selection procedure in this format or assay the peptides 

30 on individual clones for binding activity in a phage ELISA, or for binding specificity in a 
completion phage ELISA (Barret, et al. (1992) Anal Biochem 204,357-364). To identify 
the sequences of the active peptides one sequences the DNA produced by the phagemid 
host. 

35 Secondary Screening of Polypeptides and Analogs 

The high through-put assays described above can be followed by secondary screens 
in order to identify further biological activities which will, e.g., allow one skilled in the art 
to differentiate agonists from antagonists. The type of a secondary screen used will depend 
on the desired activity that needs to be tested. For example, an assay can be developed in 
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which the ability to inhibit an interaction, between a protein of interest and its respective 
ligand can be used to identify antagonists from a group of peptide fragments isolated 
though one of the primary screens described above. 

Therefore, methods for generating fragments and analogs and testing them for 
5 activity are known in the art Once the core sequence of interest is identified, it is routine 
for one skilled in the art to obtain analogs and fragments. 

Peptide Mimetics of K nvlori Polypeptides 

The invention also provides for reduction of the protein binding domains of the 

1 0 subject H. pylori polypeptides to generate mimetics, e.g. peptide or non-peptide agents. 
The peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, e.g., 
in the case of an K pylori polypeptide binding to a naturally occurring ligand. The critical 
residues of a subject H. pylori polypeptide which are involved in molecular recognition of a 
polypeptide can be determined and used to generate H. /^ton-derived peptidomimetics 

15 which competitively or noncompetitively inhibit binding of the H. pylori polypeptide with 
an interacting polypeptide (see, for example, European patent applications EP-412/762A 
andEP-B31,080A). 

For example, scanning mutagenesis can be used to map the amino acid residues of a 
particular H. pylori polypeptide involved in binding an interacting polypeptide, . 

20 peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to an interacting polypeptide, and which therefore ' 
can inhibit binding of an H. pylori polypeptide to an interacting polypeptide and thereby 
interfere with the function of H. pylori polypeptide. For instance, non-hydrolyzable peptide 
analogs of such residues can be generated using benzodiazepine (e.g., see Freidinger et al. 

25 in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher Leiden, 

Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: Chemistry and Biology, 
G.R. Marshall ed., ESCOM Publisher Leiden, Netherlands, 1988), substituted gama lactam 
rings (Garvey et al. in Peptides: Chemistry and Biology, GJL Marshall e<L, ESCOM 
Publisher: Leiden, Netherlands, 1 988), keto-methylene pseudopeptides (Ewenson et al. 

30 (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and Function 

(Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 
1985), p-turh dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. 
(1986) J Chem Soc Perkin Trans 1:1231), and p-aminoalcohols (Gordon et al. (1985) 
Biochem Biophys Res Communl26Al 9; and Dann et al. (1 9%6) Biochem Biophys Res 

35 Commun 134:71). 
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VI, Vaccin e Formulations for K pylori Nuclei Acids and Polypeptides 

This invention also features vaccine compositions for protection against infection 
by H. pylori or for treatment of K pylori infection, a gram-negative spiral microaerophilic 
bacterium. In one embodiment, the vaccine compositions contain one or more 
5 immunogenic components such as a surface protein from H. pylori, or portion thereof, and 
a pharmaceutical^ acceptable carrier. Nucleic acids within the scope of the invention are 
exemplified by the nucleic acids of the invention contained in the Sequence Listing which 
encode H. pylori surface proteins. For example, the preferred nucleic acid for a vaccine 
composition of the invention is isolated from the group of nucleic acids which encode cell 

10 envelope proteins as outlined in Table 1 . More specifically, the amino acids of SEQ ID 
NO:812, SEQ ID NO:820, SEQ ID NO:880, SEQ ID NO:658, SEQ ID NO:865, SEQ ID 
NO:1729, SEQ ID NO:1861, or fragments thereof, can be used alone or in combination for 
the formulation of vaccine compositions of the invention, as well as, their corresponding 
nucleic acids of SEQ ID NO:977, SEQ ID NO:978, SEQ ID NO:994, SEQ ID NO:215, 

15 SEQ ID NO:989, SEQ ID NO:1278, and SEQ ID NO:1410. However, any nucleic acid 
encoding an immunogenic K pylori protein, or portion thereof, which is capable of 
expression in a cell, can be used in the present invention. These vaccines have therapeutic 
and prophylactic utilities. 

One aspect of the invention provides a vaccine composition for protection against 

20 infection by K pylori which contains at least one immunogenic fragment of an K pylori 
protein and a pharmaceutical^ acceptable carrier. Preferred fragments include peptides of 
at least about 10 amino acid residues in length, preferably about 10-20 amino acid residues 
in length, and more preferably about 12-16 amino acid residues in length. 

Immunogenic components of the invention can be obtained, for example, by 

25 screening polypeptides recombinantly produced from the corresponding fragment of the 
nucleic acid encoding the fiill-length H. pylori protein. In addition, fragments can be 
chemically synthesized using techniques known in the art such as conventional Merrifield 
solid phase f-Moc or t-Boc chemistry. 

In one embodiment, immunogenic components are identified by the ability of the 

30 peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for 

example, T cell proliferation or cytokine secretion are defined herein as comprising at least 
one T cell epitope. T cell epitopes are believed to be involved in initiation and perpetuation 
of the immune response to the protein allergen which is responsible for the clinical 
symptoms of allergy. These T cell epitopes are thought to trigger early events at the level 

35 of the T helper cell by binding to an appropriate HLA molecule on the surface of an antigen 
presenting cell, thereby stimulating the T cell subpopulation with, the relevant T cell 
receptor for the epitope. These events lead to T cell proliferation, lymphokine secretion,* 
local inflammatory reactions, recruitment of additional immune cells to the site of 
antigen/T cell interaction, and activation of the B cell cascade, leading to the production of 
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antibodies. A T cell epitope is the basiaelement, or smallest unit of recognition by a T cell 
receptor, where the epitope comprises amino acids essential to receptor recognition (e.g., 
approximately 6 or 7 amino acid residues), Amino acid sequences which mimic those of 
the T cell epitopes are within the scope of this invention. 
5 Screening immunogenic components can be accomplished using one or more of 

several different assays. For example, in vitro, peptide T cell stimulatory activity is 
assayed by contacting a peptide known or suspected of being immunogenic with an antigen 
' presenting cell which presents appropriate MHC molecules in aT cell culture. Presentation 
of an immunogenic H. pylori peptide in association with appropriate MHC molecules to T 

10 cells in conjunction with the necessary costimulation has the effect of transmitting a signal 
to the T cell that induces the production of increased levels of cytokines, particularly of 
interleukin-2 and interleukin-4. The culture supernatant can be obtained and assayed for 
interleukin-2 or other known cytokines. For example, any one of several conventional 
assays for interleukin-2 can be employed, such as the assay described in Proc. Natl Acad 

15 Sci USA, 86: 1 333 (1 989) the pertinent portions of which are incorporated herein by 
reference. A kit for an assay for the production of interferon is also available from 
Genzyme Corporation (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 
thymidine incorporation. The proliferation of T cells can be measured in vitro by 

20 determining the amount of 3 H-labeled thymidine incorporated into the replicating DNA of 
cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell division 
can be quantified. ' 

Vaccine compositions of the invention containing immunogenic components (e.g., 
K pylori polypeptide or fragment thereof or nucleic acid encoding an H. pylori polypeptide 

25 or fragment thereof) preferably include a pharmaceutically acceptable carrier. The term 
"pharmaceutically acceptable carrier" refers to a carrier that does not cause an allergic 
reaction or other untoward effect in patients to whom it is administered. Suitable 
pharmaceutically acceptable carriers include, for example, one or more of water, saline, 
phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as combinations 

30 thereof. Pharmaceutically acceptable carriers may further comprise minor amounts of 

auxiliary substances such as wetting or emulsifying agents, preservatives or buffers, which 
enhance the shelf life or effectiveness of the antibody. For vaccines of the invention 
containing K pylori polypeptides, the polypeptide is coadministered with a suitable 
adjuvant. 

35 It will be apparent to those of skill in the art that the therapeutically effective 

amount of DNA or protein of this invention will depend, inter alia, upon tl^e administration 
schedule, the unit dose of antibody administered, whether the protein or DNA is 
administered in combination with other therapeutic agents, the immune status and health of 
the patient, and the therapeutic activity of the particular protein or DNA. 
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Vaccine compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Methods for intramuscular 
immunization are described by Wolff et al. (1990) Science 2£7: 1465-1468 and by Sedegah 
etal. (1994) Immunology 9J.: 9866-9870. Othermodes of administration include oral and 
5 pulmonary formulations, suppositories, and transdermal applications. Oral immimiyat ^n g, 
preferred over parenteral methods for inducing protection against infection by H. pylori. 
Czinn et al. (1993) Vaccine 1 1 : 637-642. Oral formulations include such normally 
employed excipients as, for example, phannaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. 
1 0 The vaccine compositions of the invention can include an adjuvant, including, but 

not limited to aluminum hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutamirie (thr- 
MDP); N-acetyl-nor-muramyl-L-alanyl-I>isoglutarnine (CGP 1 1637, referred to as nor- 
MDP); N-acelylmuramyl-L-al^ 

glycerc-3-hyd^oxyphos-phoryloxy)-emylamine (CGP 1 9835A, referred to a MTP-PE); 

1 5 RIBI, which contains three components from bacteria; monophosphoryl lipid A; trehalose 
■ dimycoloate; cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 80 
emulsion; and cholera toxin. Others which may be used are non-toxic derivatives of 
cholera toxin, including its B subunit, and/or conjugates or genetically engineered fusions 
of the H. pylori polypeptide with cholera toxin or its B subunit, procholeragenoid, fungal 

20 polysaccharides, mcludmg schizophyllan, muramyl dipeptide, muramyl dipeptide 

derivatives, phorbol esters, labile toxin of £ coli t non-#. pylori bacterial lysates, block 
polymers or saponins. 

Other suitable delivery methods include biodegradable microcapsules or immuno- 
stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered 

25 attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like 
particles, e.g., bluetongue. The amount of adjuvant employed will depend on the type of 
adjuvant used. For example, when the mucosal adjuvant is cholera toxin, it is suitably used 
in an amount of 5 ug to 50 ug, for example 10 ug to 35 ug. When used in the form of 
microcapsules, the amount used will depend on the amount employed in the matrix of the 

30 microcapsule to achieve the desired dosage. The determination of this amount is within the 
skill of a person of ordinary skill in the art. 

Carrier systems in humans may include enteric release capsules protecting the 
antigen from the acidic environment of the stomach, and including H. pylori polypeptide in 
an insoluble form as fusion proteins. Suitable carriers for the vaccines of the invention are 

35 enteric coated capsules and polylactide-glycolide microspheres. Suitable diluents are 0.2 N 
NaHC03 and/or saline. 

Vaccines of the invention can be administered as a primary prophylactic agent in 
adults or in children, as a secondary prevention, after successful eradication oiH. pylori in 
an infected host, or as a therapeutic agent in the aim to induce an immune response in a 
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susceptible host to prevent infection by H. pylori. The vaccines of the invention are 
administered in amounts readily determined by persons of ordinary skill in the art Thus, 
for adults a suitable dosage will be in the range of 10 jig to 10 g, preferably 10 jag to 100 
mg, for example 50 \ig to 50 mg. A suitable dosage for adults will also be in the range of 5 
5 ^g to 500 mg. Similar dosage ranges will be applicable for children. Those skilled in the 
. . art will recognize that the optimal dose may be more or less depending upon the patient's 
body weight, disease, the route of administration, and other factors. Those skilled in the ait 
will also recognize that appropriate dosage levels can be obtained based on results with 
known oral vaccines such as, for example, a vaccine based on an E. coli lysate (6 mg dose 

10 daily up to total of 540 mg) and with an enterotoxigenic K coli purified antigen (4 doses of 
1 mg) (Schulman et ah, J. Urol 150:917-921 (1993); Boedecker et al., American 
Gastroenterological Assoc. 999;A-222 (1993)). The number of doses will depend upon the 
disease, the formulation, and efficacy data from clinical trials. Without intending any 
limitation as to the course of treatment, the treatment can be administered over 3 to 8 doses 

15 for a primary immunization schedule over 1 month (Boedeker, American 
Gastroenterological Assoc. 888:A-222 (1993)). 

In a preferred embodiment, a vaccine composition of the invention can be based on 
a killed whole K coli preparation with an immunogenic fragment of an K pylori protein of 
the invention expressed on its surface or it can be based on an JEL coli lysate, wherein the 

20 killed E. coli acts as a carrier or an adjuvant 

It will be apparent to those skilled in the art that some of the vaccine compositions 
of the invention are useful only for preventing HJ pylori infection, some are useful only for 
treating H. pylori infection, and some are useful for both preventing and treating H. pylori 
infection. In a preferred embodiment, the vaccine composition of the invention provides 

25 protection against H. pylori infection by stimulating humoral and/or cell-mediated 
immunity against H. pylori. It should be understood that amelioration of any of the 
symptoms of K pylori infection is a desirable clinical goal, including a lessening of the 
dosage of medication used to treat H. pyloriomtd disease, or an increase in the 
production of antibodies in the serum or mucous of patients. 

30 

VII. Antibodies Reactive With H. pylori Polypeptides 

The invention also includes antibodies specifically reactive with the subject K 
pylori polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by 
35 Harlow and Lane (Cold Spring Harbor Press: 1 988)). A mammal such as a mouse, a 

hamster or rabbit can be immunized with an immunogenic form of the peptide. Techniques 
for conferring immunogenicity on a protein or peptide include conjugation to carriers or 
other techniques well known in the art. An immunogenic portion of the subject K pylori 
polypeptide can be administered in the presence of adjuvant The progress of immunization 
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can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or 
other immunoassays can be used with the immunogen as antigen to assess the levels of 
antibodies. 

In a preferred embodiment, the subject antibodies are immunospecific for antigenic 
5 determinants of the H. pylori polypeptides of the invention, e.g. antigenic detenninants of a 
polypeptide of the invention contained in the Sequence Listing, or a closely related human 
or non-human mammalian homolog (e.g., 90%' homologous, more preferably at least 95% 
homologous). In yet a further preferred embodiment of the invention, the anti-H pylori 
antibodies do not substantially cross react (i.e., react specifically) with a protein which is 

10 for example, less than 80% percent homologous to a sequence of the invention contained in 
the Sequence Listing. By "not substantially cross react", it is meant that the antibody has a 
binding affinity for a non-homologous protein which is less than 10 percent, more 
preferably less than 5 percent, and even more preferably less than 1 percent, of the binding 
affinity for a protein of the invention contained in the Sequence Listing. In a most 

15 preferred embodiment, there is no crossreactivity between bacterial and mammalian 
antigens. • 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with H. pylori polypeptides. Antibodies can be fragmented using 
conventional techniques and the fragments screened for utility in the same manner as 

20 described above for whole antibodies. For example, F(aV)2 fragments can be generated by 
treating antibody with pepsin. The resulting F(ab')2 fragment can be treated to reduce 
disulfide bridges to produce Fab' fragments. The antibody of the invention is further 
intended to include bispecific and chimeric molecules having ah anti-H pylori portion. 
Both monoclonal and polyclonal antibodies (Ab) directed against H. pylori 

25 polypeptides or H. pylori polypeptide variants, and antibody fragments such as Fab' and 
F ( ab ')2» 0311 be used to block the action of H. pylori polypeptide and allow the study of the 
role of a particular H. pylori polypeptide of the invention in aberrant or unwanted 
intracellular signaling, as well as the normal cellular function of the H. pylori and by 
microinjection of anti-/f. pylori polypeptide antibodies of the present invention. 

30 Antibodies which specifically bind H. pylori epitopes can also be used in 

immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of K pylori antigens. Ami H. pylori polypeptide antibodies can be 
used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate H. 
pylori levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, the 

35 ability to monitor H. pylori polypeptide levels in an individual can allow determination of 
the efficacy of a given treatment regimen for an individual afflicted with such a disorder. 
The level of an H. pylori polypeptide can be measured in cells found in bodily fluid, such 
as in urine samples or can be measured in tissue, such as produced by gastric biopsy. 
Diagnostic assays using anti-H pylori antibodies can include, for example, immunoassays 
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designed to aid in early diagnosis of H. pylori infections. The present invention can also be 
used as a method of detecting antibodies contained in samples from individuals infected by 
this bacterium using specific H. pylori antigens. 

Another application of anti-H pylori polypeptide antibodies of the invention is in 
5 the immunological screening of cDNA libraries constructed in expression vectors such as 
Xgtll,Xgtl8-23,AZAP,andXORF8. Messenger libraries of this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion 
proteins. For instance, Xgtl 1 will produce fusion proteins whose amino termini consist of 
B-galactosidase amino acid sequences and whose carboxy termini consist of a foreign 

10 polypeptide. Antigenic epitopes of a subject H. pylori polypeptide can then be detected 
with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates 
with anti-H pylori polypeptide antibodies. Phage, scored by this assay, can then be 
isolated from the infected plate. Thus, the presence of H. pylori gene homologs can be 
detected and cloned from other species, and alternate isofonns (including splicing variants) 

IS can be detected and cloned. 

VEIL Kits Containing Nucleic Acids. Polypeptides or Antibodies of the Invention 

The nucleic acid, polypeptides and antibodies of the invention can be combined 
with other reagents and articles to form kits. Kits for diagnostic purposes typically 

20 comprise the nucleic acid, polypeptides or antibodies in vials or other suitable vessels. Kits 
typically comprise other reagents for performing hybridization reactions, polymerase chain 
reactions (PCR), or for reconstitution of lyophilized components, such as aqueous media, 
salts, buffers, and the like. Kits may also comprise reagents for sample processing such as 
detergents, chaotropic salts and the like. Kits may also comprise immobilization means 

25 such as particles, supports, wells, dipsticks and the like. Kits may also comprise labeling 
means such as dyes, developing reagents, radioisotopes, fluorescent agents, luminescent or 
chemiluminescent agents, enzymes, intercalating agents and the like. With the nucleic acid 
and amino acid sequence information provided herein, individuals skilled in art can readily 
assemble kits to serve their particular purpose. Kits further can include instructions for use. 

30 

IX. Drug Screening Assays Using K pylori Polypeptides 

By making available purified and recombinant H. pylori polypeptides, the present 
invention provides assays which can be used to screen for drugs which are either agonists 
or antagonists of the normal cellular function, in this case, of the subject K pylori 
35 polypeptides, or of their role in intracellular signaling. Such inhibitors or potentiators may 
be useful as new therapeutic agents to combat H. pylori infections in humans. A variety of 
assay formats will suffice and, in light of the present inventions, will be comprehended by 
the skilled artisan. 
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In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often preferred 
5 as "primary" screens in that they can be generated to permit rapid development and 

relatively easy detection of an alteration in a molecular target which is mediated by a test 
compound Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead being focused 
primarily on the effect of the drug on the molecular target as may be manifest in an 

1 0 alteration of binding affinity with other proteins or change in enzymatic properties of the 
molecular target Accordingly, in an exemplary screening assay of the present invention, 
the compound of interest is contacted with an isolated and purified H. pylori polypeptide. 

Screening assays can be constructed in vitro with a purified K pylori polypeptide or 
fragment thereof, such as an H. pylori polypeptide having enzymatic activity, such that the 

15 activity of the polypeptide produces a detectable reaction product The efficacy.of the 
compound can be assessed by generating dose response curves from data obtained using 
various concentrations of the test compound. Moreover, a control assay can also be 
performed to provide a baseline for comparison. Suitable products include those with 
distinctive absorption, fluorescence, or chemi-luminescence properties, for example, 

20 because detection may be easily automated. A variety of synthetic or naturally occurring 
compounds can be tested in the assay to identify those which inhibit or potentiate the 
activity of the H. pylori polypeptide. Some of these active compounds may directly, or 
with chemical alterations to promote membrane permeability or solubility, also inhibit or 
potentiate the same activity (e.g., enzymatic activity) in whole, live H. pylori cells. 

25 

EXEMPLIFICATION 

I. Cloning and Sequencing of K pylori DNA 

H. pylori chromosomal DNA was isolated according to a basic DNA protocol 
30 outlined in Schleif R.F. and Wensink P.C., Practical Methods in Molecular Biology, p.98, 

Springer-Verlag, NY., 1981, with minor modifications. Briefly, cells were pelleted, 

resuspended in TE (10 mM Tris, 1 mM EDTA, pH 7.6) and GES lysis buffer (5.1 M 

guanidium thiocyanate, 0.1 M EDTA, pH 8.0, 0.5% N-laurylsarcosine) was added. 

Suspension was chilled and ammonium acetate (NH 4 Ac) was added to final concentration 
35 of 2.0 M. DNA was extracted, first with chloroform, then with phenol-chloroform, and 

reextracted with chloroform. DNA was precipitated with isopropanol, washed twice with 

70% EtOH, dried and resuspended in TE. 

. Following isolation whole genomic K pylori DNA was nebulized (Bodenteich et 

aL, Automated DNA Sequencing and Analysis (J.C. Venter, ed.), Academic Press, 1994) to 
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a median size of 2000 bp. After nebulization, the DNA was concentrated and separated on 
a standard 1% agarose gel. Several fractions, corresponding to approximate sizes 900-i300 
bp, 1300-1700 bp, 1700-2200 bp, 2200-2700 bp, were excised from the gel and purified by 
the GeneClean'procedure (BiolOl, Inc.). 

5 The purified DNA fragments were then blunt-ended using T4 DNA polymerase. 

The healed DNA was then ligated to unique BstXI-linker adapters (5' 
TCTAGACCACCTGC and 5' GTGGTCTAGA in 100-1000 fold molar excess). These 
linkers are complimentary to the BstXI-cut pMPX vectors, while the overhang is not self- 
complimentary. Therefore, the linkers will not concatemerize nor will the cut-vector 

10 religate itself easily. The linker-adopted inserts were separated from the unincorporated 
linkers on a 1% agarose gel and purified using GeneClean. The linker-adopted inserts were 
then ligated to each of the 20 pMPX vectors to construct a series of "shotgun" subclone 
libraries. The vectors contain an out-of-frame lacZ gene at the cloning site which becomes 
in-frame in the event that an adapter-dimer is cloned, allowing these to be avoided by their 

15 blue-color. 

All subsequent steps were based on the multiplex DNA sequencing protocols 
outlined in Church G.M. and Kieffer-Higgins S., Science 240:185-188, 1988. Only major 
modifications to the protocols are highlighted. Briefly, each of the 20 vectors was then 
transformed into DH5a competent cells (Gibco/BRL, DH5a transformation protocol). The 
20 libraries were assessed by plating onto antibiotic plates cbntairimg mpiciUin, metMcillin 
and IPTG/Xgal. The plates were incubated overnight at 37°C. Successful transformants 
were then used for plating of clones and pooling into the multiplex pools. The clones were 
picked and pooled into 40 ml growth medium cultures. The cultures were grown overnight 
at 37°C. DNA was purified using the Qiagen Midi-prep kits and Tip-100 columns 
25 (Qiagen, Inc.). In this manner, 1 00 u.g of DNA was obtained per pool. Fifteen 96-well 
plates of DNA were generated to obtain a 5-10 fold sequence redundancy assuming 250- 
300 base average read-lengths. 

These purified DNA samples were then sequenced using the multiplex DNA 
sequencing based on chemical degradation methods (Church G.M. and Kieffer-Higgins S., 
30 Science 240:185-188, 1988) or by Sequithrem (Epicenter Technologies) dideoxy 

sequencing protocols. The sequencing reactions were electrophoresed and transferred onto 
nylon membranes by direct transfer electrophoresis from 40 cm gels (Richterich P. and 
Church G.M., Methods in Enzymology 218:187-222, 1993) or by electroblotting (Church, 
supra). 24 samples were run per gel. 45 successful membranes were produced by chemical 
35 sequencing and 8 were produced by dideoxy sequencing. The DNA was covalently bound 
to the membranes by exposure to ultraviolet light, and hybridized with labeled 
oligonucleotides complimentary to tag sequences on the vectors (Church, supra). The 
membranes were washed to rinse off non-specifically bound probe, and exposed to X-ray 
film to visualize individual sequence ladders. After autoradiography, the hybridized probe 
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was removed by incubation at 65° C, an4 the hybridization cycle repeated with another tag 
sequence until the membrane had been probed 3 8 times for chemical sequencing 
membranes and 10 times for the dideoxy sequencing membranes. Thus, each gel produced 
a large number of films, each containing new sequencing information. Whenever a new 
5 blot was processed, it was initially probed for an internal standard sequence added to each 
of the pools. 

Digital images of the films were generated using a laser-scanning densitometer 
(Molecular Dynamics, Sunnyvale, CA). The digitized images were processed on computer 
workstations (VaxStation 4000's) using the program REPLICA™ (Church et aL, 

10 Automated DNA Sequencing and Analysis (J.C. Venter, ed.), Academic Press, 1994). 

Image processing included lane straightening, contrast adjustment to smooth out intensity 
differences, and resolution enhancement by iterative gaussian deconvolution. The 
sequences were then automatically picked in REPLICA™ and displayed for interactive 
proofreading before being stored in a project database. The proofreading was accomplished 

IS by a quick visual scan of the film image followed by mouse clicks on the bands of the 
displayed image to modify the base calls.* Many of the sequence errors could be detected 
and corrected because multiple sequence reads covering the same portion of the genomic 
DNA provide adequate sequence redundancy for editing. Each sequence automatically 
received an identification number (corresponding to microtiter plate, probe information, 

20 and lane set number). This number serves as a permanent identifier of the sequence so it is 
always possible to identify the original of any particular sequence without recourse to a 
specialized database. 

Routine assembly of K pylori sequences was done using the program FALCON 
(Church, Church et aL, Automated DNA Sequenicng and Analysis (J.C. Venter, ed.), 

25 Academic Press, 1 994). This program has proven to be fast and reliable for most 
sequences. The assembled contigs were displayed using a modified version of 
GelAssemble, developed by the Genetics Computer Group (GCG) (Devereux et al., Nucleic 
Acid Res. 12:387-95, 1984) that interacts with REPLICA™. This provided for an 
integrated editor that allows multiple sequence gel images to be instantaneously called up 

30 from the REPLICA™ database and displayed to allow rapid scanning of contigs and 

proofreading of gel traces where discrepancies occurred between different sequence reads 
in the assembly. 

II. Identification, cloning and expression of recombinant H. vvlori DNA sequences 
35 To facilitate the cloning, expression and purification of membrane and secreted 

proteins from H. pylori a powerful gene expression system, the pET System (Novagen), for 
cloning and expression of recombinant proteins in E. coli 9 was selected. Also, a DNA 
sequence encoding a peptide tag, the His-Tag, was fused to the 3' end of DNA sequences of 
interest in order to facilitate purification of the recombinant protein products. The 3' end 
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was selected for fusion in order to avoid alteration of any 5' terminal signal sequence. The 
exception to the above was ppiB, a gene cloned for use as a control in the expression 
studies. In this study, the sequence for H /y/or/ ppiB contains a DNA sequence encoding a * 
His-Tag fused to the 5' end of the full length gene, because the protein product of this gene 
5 does not contain a signal sequence and is expressed as a cytosolic protein. 

PCR Amplification and cloning of DNA sequences containing ORF'sfor membrane and 
secreted proteins from the J99 Strain of Helicobacter pylori. 

Sequences chosen (from the list of the DNA sequences of the invention) for cloning 

10 from the J99 strain of H. pylori were prepared for amplification cloning by polymerase 

chain reaction (PCR). Synthetic oligonucleotide primers (Table 4) specific for the 5' and 3* 
ends of open reading frames (ORFs) were designed and purchased (GibcoBRL Life 
Technologies, Gaithersburg, MD, USA). All forward primers (specific for the 5* end of the 
sequence) were designed to include an Ncol cloning site at the extreme 5' terminus, except 

15 for HpSeq. 4821082 (SEQ ID NO: 820) where Ndel was used. These primers were 

designed to permit initiation of protein translation at a methionine residue followed by a 
valine residue and the coding sequence for the remainder of the native H. pylori DNA 
sequence. An exception is K pylori sequence 4821082 (SEQ ID NO: 820) where the 
initiator methionine is immediately followed by the remainder of the native H. pylori DNA 

20 sequence. All reverse primers (specific for the 3' end of any H. pylori ORF) included a 
EcoRI site at the extreme 5' terminus to permit cloning of each H. pylori sequence into the 
reading frame of the pET-28b. The pET-28b vector provides sequence encoding an 
additional 20 carboxy-terminal amino acids (only 19 amino acids in HpSeq. 26380318 
(SEQ ID NO: 658)and HpSeq.14640637 (SEQ ID NO: 447)) including six histidine 

25 residues (at the extreme C-terminus), which comprise the His-Tag. An exception to the 
above, as noted earlier, is the vector construction for the ppiB gene. A synthetic 
oligonucleotide primer specific for the 5' end of ppiB gene encoded a BamHI site at its 
extreme 5' terminus and the primer for the 3' end of the ppiB gene encoded a Xhol site at 
its extreme 5' terminus. 
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Oligonucleotide primers used for PCR amplification of H. pylori DNA sequences 



Outer membrane Proteins 


Forward primer 5* t 3' 


Reverse Primer 5' to 3' 


16225006 (SEQ ID NO: 
465) 


5-TATACCATGGTGGG 

CGCTAA-^SEQID 

NO:1897) 


5'-ATGAATTCGAGTAAG 
GA'1T1T1G-3'(SEQID 
NO: 1898) 


26054702 (SEQ ID NO: 
649) 


5'-TTAACCATGGTGAAA 
AGCGATA-3' (SEQ ID 
NO:1899) 


S'-TAGAATTCGCATAAC 
GATCAATC-3' (SEQ ID 
NO:1900) 


71 16626 (SEQ ID NO: 865) 


5'-ATATCCATGGTGAGT 
TTGATGA-3' (SEQ ID 
NO:1901) 


S'-ATGAATTCAATTTTT 
TAT^TGCCA-3 , (SEQ ID 
NO:1902) 


29479681 (SEQ ID NO: 
677) 


5'-AATTCCATGGTGGGG 
GCTATG-3' (SEQ ID 
NO:1903) 


S'-ATGAATTCTCGATAG 
CCAAAATC-3' (SEQ ID 
NO:1904) 


14640637 (SEQ ID NO: 
447) 


5'-AATTCCATGGTGCAT 
AACTTCCATT-3' (SEQ ID 
NO:1905) 


5'-AAGAATTCTCTAGCA 
TCCAAATGGA-3' (SEQ 
IDNO:1906) 


Periplasmic/ Secreted 
Proteins 






30100332 (SEQ ID NO: 
685) 


5'-ATTTCCATGGTCATG 
TCTCATATT-3' (SEQ ID 
NO:1907) 


S'-ATGAATTCCATCTTT 

TATTCCAC-3'(SEQID 

NO:1908) 


4721061 (SEQ ID NO: 812) 


S'-AACCATGGTGATTT 

TAAGCATTGAAAG-3' 

(SEQBDNO:1909) 


S'-AAGAATTCCACTCA 
AAATlTriTAACAO-3' 
(SEQ ID NO: 1910) 


Other Surface Proteins 






4821082 (SEQ ID NO: 820) 


5'-GATCATCCATATGTT 
ATCTTCTAAT-3' (SEQ ID 
NO:1911) 


S'-TGAATTCAACCATTT 

TAACCCTG-S^SEQID 

NO:1912) 


978477 (SEQ ID NO: 880) 


5»-TATACCATGGTGAA 

ATTTTTTCTTTTA-3' 

(SEQIDNO:1913) 


5'-AGAATTCAATTGCG 
TCTrGTAAAAG-S' (SEQ 
IDNO:1914) 


Inner Membrane Protein 






26380318 (SEQ ID NO: 
658) 


5:-TATACCATGGTGAT 
GGACAAACTC-3* (SEQ 
IDNO:1915) 


S'-ATGAATTCCCACTT 
GGGGCGATA-3 1 (SEQ ID 
NO:1916) 


Cytoplasmic Protein 






ppi 


5'-TTATGGATCCAAAC 
C AATT AAAACT-3 ' (SEQ 
IDNO:1917) 


5-TATCTCGAGTTATA 
GAGAAGGGC-3 1 (SEQ ID 
NO:1918) 
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Genomic DNA prepared from the J99 strain of if. pylori (ATCC #55679) was used 
as the source of template DNA for PCR amplification reactions (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1 994). To amplify a 
DNA sequence containing an H. pylori ORF, genomic DNA (50 nanograms) was 
5 mrroducedmtoarea<^onvialcontainkg2mMMgCl2, 1 micromolar synthetic 

oligonucleotide primers (forward and reverse primers) complementary to and flanking a 
defined K pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, 
dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular 
Systems, Inc., Branchburg, NJ, USA) in a final volume of 100 microliters. The following 
1 0 thermal cycling conditions were used to obtain amplified DNA products for each ORF 
using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: 

Sequences 26054702 (SEQ ID NO: 649), 7116626 (SEQ ID NO: 865), 29479681 (SEQ 
ID NO: 677), 30100332 (SEQ ID NO: 685), 4821082 (SEQ ffi NO: 820) and 978477 
15 (SEQ ID NO: 880); 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 1 5 sec, 30°C for 15 sec and 72°C for 1 .5 min 
23 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

20 

Sequence 16225006 (SEQ ID NO: 465); 
Denaturation at 94°C for 2 min, 

25 cycles at 95°C for 15 sec, 55°C for 15 sec and 72°C for 1 .5 min 
Reaction was concluded at 72°C for 6 minutes. 

25 

Sequence 4721061 (SEQ ID NO: 812); 
Denaturation at 94°C for 2 min. 

2 cycles at 94°.C for 1 5 sec, 36°C for 15 sec and 72°C for 1 .5 min 
23 cycles at 94°C for 1 5 sec. 60°C for 1 5 sec and 72°C for 1 .5 min 
30 Reactions were concluded at 72°C for 6 minutes. 

Sequence 26380318 (SEQ ID NO: 658); 
Denaturation at 94°C for 2 min. 

2 cycles at 94°C for 15 sec. 38°C for 15 sec and 72°C for 1.5 min 
35 23 cycles at 94°C for 1 5 sec, 62°C for 1 5 sec and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 6 minutes. 
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Sequence 14640637 (SEQ ID NO: 447); 
Denaturation at 94°C for 2 niin, 

2 cycles at 94°C for 15 sec, 33°C for 15 sec and 72°C for 1.5 min 
30 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
5 Reactions were concluded at 72°C for 6 minutes. 

. Conditions for amplification of H. pylori ppiB; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 32°C for 15 sec and 72°C for 1.5 min 
10 25 cycles at 94°C for 15 sec, 56°C for 15 sec and 72°C for 1.5 min 

Reactions were concluded at 72°C for 6 minutes : 

Upon completion of thermal cycling reactions, each sample of amplified DNA was 
washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, 

1 5 MD, USA). All amplified DNA samples were subjected to digestion with the restriction 
endonucleasesi Ncol and EcoRI (New England BioLabs, Beverly, MA, USA), or in the 
case of HpSeq. 4821082 (SEQ ID NO: 820), with Ndel and EcoRI (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et aL, eds., 1994). E)NA 
samples were then subjected to electrophoresis on 1 .0 % NuSeive (FMC BioProdiicts, 

20 Rockland, ME USA) agarose gels. DNA was visuialized by exposure to ethidium bromide 
and long wave uv irradiation. DNA contained in slices isolated from the agarose gel was 
purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA, USA) 
Cloning o/H. pylori DNA sequences into the pET~28b prokaryotic expression vector. 

The pET-28b vector was prepared for cloning by digestion with Ncol and EcoRI. or 

25 in the case of K pylori sequence 4821082 (SEQ ID NO: 820) with Ndel an d EcoRI 

(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et aL, eds., 
1 994). In the case of cloning ppiB, the pET-28a vector, which encodes a His-Tag that can 
be fused to the 5' end of an inserted gene, was used and the cloning site prepared for 
cloning with the ppiB gene by digestion with BamHI and Xhol restriction endonucleases. 

30 Following digestion, DNA inserts were cloned (Current Protocols in Molecular 

Biology, John Wiley and Sons, Inc., F. Ausubel et aL, eds., 1 994) into the previously 
digested pET-28b expression vector, except for the amplified insert for ppiB, which was 
cloned into the pET-28a expression vector. Products of the ligation reaction were then used 
to transform the BL21 strain of E. coli (Current Protocols in Molecular Biology, John 

35 Wiley and Sons, Inc., F. Ausubel et aL, eds., 1994) as described below. 



Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), were 
transformed with recombinant pET expression plasmids carrying the cloned H. pylori 
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sequences according to standard methods (Current Protocols in Molecular, John Wiley and 
Sons, Inc., F. Ausubel et aL, eds., 1994). Briefly, 1 microliter of ligation reaction was 
mixed with 50 microliters of electrocompetent cells and subjected to a high voltage pulse, 
after which* samples were incubated in 0.45 milliliters SOC medium (0.5% yeast extract, 
5 2.0 % tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgC12, 10 mM MgS04 and 20, mM 
glucose) at 37°C with shaking for 1 hour; Samples were then spread on LB agar plates 
containing 25 microgram/ml kanamycin sulfate for growth overnight Transformed 
colonies of BL21 were then picked and analyzed to evaluate cloned inserts as described 
below. 

10 " - 
Identification of recombinant pET expression plasmids carrying K pylori sequences 

Individual BL2 1 clones transformed with recombinant pET-28b-H.pylori ORFs 
were analyzed by PGR amplification of the cloned inserts using the same forward and 
reverse primers, specific for each H. pylori sequence, that were used in the original PCR 

15 amplification cloning reactions, Successfiil amplification verified the integration of the H. 
pylori sequences in the expression vector (Current Protocols in Molecular Biology, John ■ 
Wiley and Sons, Inc., F. Ausubel et aL, eds., 1994). 

Isolation and Preparation of plasmid DNA from BL21 transformants 
20 Individual clones of recombinant pET-28b vectors carrying properly cloned K 

pylori ORFs were picked and incubated in 5 mis of LB broth plus 25 microgram/ml 
kanamycin sulfate overnight. The following day plasmid DNA was isolated and purified 
using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, USA). 

25 Expression of recombinant H. pylori sequences in E. coli 

The pET vector can be propagated in any E. coli K-12 strain e.g. HMS174, HB101, 
JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for expression 
include E. coli strains containing a chromosomal copy of the gene for T7 RNA polymerase. 
These hosts are lysogens of bacteriophage DE3, a lambda derivative that carries the lad 

30 gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA polymerase is 
induced by addition of isopropyl-B-D-thiogalactoside (IPTG), and the T7 RNA polymerase 
transcribes any target plasmid, such as pET-28b, carrying a T7 promoter and a gene of 
interest. Strains used include: BL21(DE3) (Studier, F.W., Rosenberg, A.H., Dunn, J.J., and 
Dubendorff, J.W. (1990) Meth. Enzymol. 185, 60-89). 

35 To express recombinant R pylori sequences, 50 nanograms of plasmid DNA 

isolated as described above was used to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression system kit). The lacZ 
gene (beta-galactosidase) was expressed in the pET-System as described for the K pylori 
recombinant constructions. Transformed cells were cultured in SOC medium for 1 hour, 
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and the culture was then plated on LB plates containing 25 micrograms/ml kanamycin 
sulfate. The following day, bacterial colonies were pooled and grown in LB medium 
containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 nM of 0.5 to 
1 .0 O.D. units, at which point, 1 millimolar IPTG was added to the culture for 3 hours to 

5 induce gene expression of the H. pylori recombinant DNA constructions . 

After induction of gene expression with IPTG, bacteria were pelleted by 
centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 1 5 minutes at 4°C. Pellets 
were resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 
mM EDTA (STE buffer). Cells were then centrifuged at 2000 x g for 20 min at 4°C. Wet 

1 0 pellets were weighed and frozen at -80°C until ready for protein purification. 

III. Purification of recombinant proteins from £ colt 
Analytical Methods 

The concentrations of purified protein preparations were quantified 

1 5 spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, S J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations were also 
measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, and 
Lowry, O.H., Rosebrough, N., Farr, AX. & Randall, RJ. (1951) J. Biol. Chem. 193, pages 
265-275, using bovine serum albumin as a standard. 

20 SDS-polyacrylamide gels (12% or 4.0 to 25 % acrylamide gradient gels) were 

purchased from BioRad (Hercules, CA, USA), and stained with Coomassie blue. 
Molecular weight markers included rabbit skeletal muscle myosin (200 kDa), £. coli (- 
galactosidase (1 16 kDa), rabbit muscle phosphoiylase B (97.4 kDa), bovine serum albumin 
(66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (31 kDa), soybean trypsin 

25 inhibitor (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 kDa). 

L Purification of soluble proteins 

All steps were carried out at 4°C. Frozen cells were thawed, resuspended in 5 
volumes of lysis buffer (20 mM Tris, pH 7.9, 0.5 M NaCl, 5 mM imidazole with 10% 

30 glycerol, 0.1 % 2-mercaptoethanoL 200 \igf ml lysozyme, 1 mM phenylmethylsulfonyl 
fluoride (PMSF), and 1 0 ug/ml each of leupeptin, aprotinin, pepstatin, L-l-chloro-3-[4- 
tosylamido]-7-amino-2-heptanone (TLCK), L-l-chloro-3-[4-tosylamido]-4-phenyl-2- 
butanone (TPCK), and soybean trypsin inhibitor, and ruptured by several passages through 
a small volume microfluidizer (Model M-l 10S, Microfluidics International Corporation, 

35 Newton, MA). The resultant homogenate was made 0.1 % Brij 35, and centrifuged at 
100,000 x g for 1 hour to yield a clear supernatant (crude extract). 

Following filtration through a 0.8 \un Supor filter (Gelman Sciences, FRG) the 
crude extract was loaded directly onto aNi 2+ " nitrilotriacetate-agarose (NTA) with a 5 
milliliter bed volume (Hochuii, E., Dbeli, H., and Schacheer, A. (1987) J. Chromatography 
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41 1, 177-184) pre-equilibrated in lysis buffer containing 10 % glycerol, 0.1 % Brij 35 and 1 
mM PMSF. The column was washed with 250 ml (50 bed volumes) of lysis buffer 
containing 10 % glycerol, 0.1 % Brij 35, and was eluted with sequential steps of lysis 
buffer containing 10 % glycerol, 0.05 % Brij 35, 1 mM PMSF, and 20, 100, 200, and 500 
5 mM imidazole in succession. Fractions were monitored by absorbance at OD2go ^ and 
peak fractions were analyzed by SDS-PAGE. Fractions containing the recombinant protein 
eluted at 100 mM imidazole. 

Recombinant protein 14640637 (SEQ ID NO: 447) and proteins, beta-galactosidase (lacZ) 

10 andpeptidyl-prolylcis-transisomerase (ppiB) 

Fractions containing the recombinant proteins from the Ni 2+ -NTA-agarose columns 
were pooled and then concentrated to approximately 5 ml by centrifugal filtration 
(Centriprep-10, Amicon, MA), and loaded directly onto a 180-ml column (1.6 X 91 cm) of 
Sephacryl S-100 HR gel filtration medium equilibrated in Buffer A (10 mM Hepes, pH 7.5, 

15 150 mM NaCl, 0.1 mM EGTA) and run in Buffer A at 18 ml/h. Fractions containing the 
recombinant protein were identified by absorbance at 280 nm arid analyzed by SDS-PAGE. 
Fractions were pooled and concentrated by centrifugal filtration. 

Recombinant protein 7116626 (SEQ ID NO: 865) 

20 Fractions containing the recombinant protein from the Ni 2+ -NTA-agarose column 

were pooled and dialyzed overnight against 1 liter of dialysis buffer (1 0 mM MOPS, pH 
6.5, 50 mM NaCl, 0. 1 mM EGTA, 0.02% Brij 35 and 1 mM PMSF). In the morning, a fine 
white precipitate was removed by centrifugation and the resulting supernatant was loaded 
onto an 8 ml (8 x 75 mm) MonoS high performance liquid chromatography column 

25 (Pharmacia Biotechnology, Inc., Piscataway, NJ, USA) equilibrated in buffer B (10 mM 
MOPS, pH 6.5, 0.1 mM EGTA) containing 50 mM NaCl. The column was washed with 10 
bed volumes of buffer B containing 50 mM NaCl, and developed with a 50-ml linear 
gradient of increasing NaCl (50 to 500 mM). Recombinant protein 71 16626 (SEQ ID NO: 
865) eluted as a sharp peak at 300 mM NaCl. 

30 

2. Purification of insoluble proteins from inclusion bodies 

The following steps were carried out at 4°C. Cell pellets were resuspended in lysis 
buffer with 10% glycerol 200 ug/ ml lysozyme, 5 mM EDTA, ImM PMSF and 0.1 % - 
mercaptoethanol. After passage through the cell disrupter, the resulting homogenate was 
35 made 0.2 % deoxycholate, stirred 10 minutes, then centrifuged at 20,000 x g, for 30 min. 
The pellets were washed with lysis buffer containing 10 % glycerol, 10 mM EDTA, 1% 
Triton X-100, 1 mM PMSF and 0.1% -mercaptoethanol, followed by several washes with 
lysis buffer containing 1 M urea, 1 mM PMSF and 0.1 A 2-mercaptoethanol. The resulting 
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white pellet was composed primarily of inclusion bodies, free of unbroken cells and 
membranous materials.. 

Recombinant proteins 26054702 (SEQ ID NO: 649), 16225006 (SEQ ID NO: 465), 
5 30100332 (SEQ ID NO: 685), 4721061 (SEQ ID NO: 812) 

The following steps were carried out at room temperature. Purified inclusion bodies 
were dissolved in 20 ml 8.0 M urea in lysis buffer with 1 mM PMSF and 0.1 % 2- 
mercaptoethanol, and incubated at room temperature for 1 hour. Materials that did not 
dissolve were removed by centrifugation. The clear supernatant was filtered, then loaded 

1 0 onto a Ni 2+ -NTA agarose column pre-equilibrated in 8.0 M urea in Lysis Buffer. The 
column was washed with 250 ml (50 bed volumes) of lysis buffer containing 8 M urea, 1.0 
mM PMSF and 0.1 % 2-mercaptoethanoi, and developed with sequential steps of lysis 
buffer containing 8M urea, 1 mM PMSF, 0.1 % 2-mercaptoethanol and 20, 1 00, 200, and 
500 mM imi dazole in succession. Fractions were monitored by absorbance at OD280 o™* 

15 and peak fractions were analyzed by SDS-PAGE. Fractions containing the recombinant 
protein eluted at 100 mM imidazole. 

Recombinant proteins 29479681 (SEQ ID NO: 677), 978477 (SEQ ID NO: 880), 26380318 
(SEQ ID NO: 658) 

20 The pellet containing the inclusion bodies was solubilized in buffer B containing 8 

M urea, 1 mM PMSF and 0.1 % 2-mercaptoethanol, and incubated for 1 hour at room 
temperature. Insoluble materials were removed by centrifugation at 20,000 x g for 30 min, 
and the cleared supernatant was loaded onto a 1 5 ml ( 1 .6 x 7.5 cm ) SP-Sepharose column 
pre-equilibrated in buffer B, 6 M urea, 1 mM PMSF, 0.1 % 2-mercaptbethanol. After 

25 washing the column with 1 0 bed volumes, the column was developed with a linear gradient 
from 0 to 500 mM NaCl. 

Dialysis and concentration of protein samples 

Urea was removed slowly from the protein samples by dialysis against Tris- 
30 buffered saline (TBS; 10 mM Tris pH 8.0, 150 mM NaCl) containing 0.5 % deoxycholate 
(DOC) with sequential reduction in urea concentration as follows; 6M, 4M, 3M, 2M, 1M, 
0.5 M and finally TBS without any urea. Each dialysis step was conducted for a minimum 
of 4 hours at room temperature. 

After dialysis, samples were concentrated by pressure filtration using Amicon 
35 stirred-cells. Protein concentrations were measured using the methods of Perkins (1986 
Eur. J. Biochem. 157, 169-180), Bradford ((1976) Anal. Biochem. 72, 248-254) and Lowry 
((195 1) J. Biol. Chem. 193, pages 265-275). 

The recombinant proteins purified by the methods described above are summarized 
in Table 5 below. 
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TV. Analysis of H. p yfnri protein s as Vaccine candidates 

To investigate the immunomodulatory effect of H. pylori proteins, a mouse///. 
pylori model was used. This model mimics the human H. pylori infection in many 
respects. The focus is on the effect of oral imrmmiTation in H. pylori infected animals in 
5 order to test the concept of therapeutic oral immunotherapy. 

Animals 

Female SPF BALB/c mice were purchased from Bomholt Breeding center 
(Denmark). They were kept in ordinary makrolon cages with free supply of water and food. 
10 The animals were 4-6 weeks old at arrival. 

Infection 

After a minimum of one week of acclimatization, me am 
type 2 strain (VacA negative) of H. pylori (strain 244, originally isolated from an ulcer 

15 patient). In our hands, this strain has earlier proven to be a good colonizer of the mouse 
stomach. The bacteria were grown overnight in Brucella broth supplemented with 1 0 % 
fetal calf serum, at 37°C in a microaerophilic atmosphere (10% CO* 5%02). The animals 
were given an oral dose of omeprazole (400 umol/kg) and 3-5 h after this an oral 
inoculation of H /y/or/ in broth (approximately lO'crWanimal). Positive take of the ~ 

20 infection was checked in some animals 2-3 weeks after the inoculation. 

. Antigens 

Recombinant H. pylori antigens were chosen based on their association with 
externally exposed H. pylori cell membrane. These antigens were selected from the 
25 following groups: (1 .) Outer Membrane Proteins; (2.) Periplastic/Secreted proteins; (3.) 
Outer Surface proteins; and (4.) Inner Membrane proteins. All recombinant proteins were 
constructed with a hexa-fflS tag for purification reasons and the non-Helicobacter pylori 
control protein (P-galactosidase from K coli; LacZ), was constructed in the same way. 

All antigens were given in a soluble form, i.e. dissolved in either aHEPES buffer or 
30 in a buffer containing 0.5% Deoxycholate (DOC). 

The antigens are listed in Table 6 below. 



35 



Table 6 

Helicobacter pylori proteins 



Outer membrane Proteins 
SEQIDNO-.447 
SEQIDNO:677 
SEQIDNO:865 
40 SEQIDNO:812 
SEQIDNO:465 
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Periplastic/Secreted proteins 
SEQIDNO:685 

5 Other cell envelope proteins 
SEQIDNO:820 
SEQIDNO:880 

Flagella-associated proteins 
10 SEQIDNO:658 

Control proteins 

p-galactosidase (LacZ) 

15 Immunizations 

Ten animals in each group were immunized 4 times over a 34 day period (day 1,15, 
25 and 35). Purified antigens in solution or suspension were given at a dose of 1 00 
fig/mouse. As an adjuvant, the animals were also given 10 ng/mouse of Cholera toxin (CT) 
with each immunization. Omeprazole (400 (imol/kg) was given orally to the animals 3-5 h 
20 prior to imTminiTfltinn as a way nf protecting the antigens from acid degradation. Infected 
control animals received HEPES buffer + CT or DOC buffer + CT. Animals were 
sacrificed 2-4 weeks after final immunization. A general outline of the study is shown in 
Table 7 below. 

25 Table 7 

Study outline, therapeutic immunization: 



Mice were all infected with K pylori strain Ah244 at day 30. Proteins are listed by their 
SeqIDtfs. 



30 




Mouse strain 




Dates for 




Substance 


B=10 


Dose/mouse 


dosine 




1. Controls, PBS 


Balb/c 


0,3 ml 


0,14,24,34 




2. Cholera toxin, 10 fig 


Balb/c 


0,3 ml 


0,14,24,34 


35 


3. Protein 447, 100 fig + CT 10 fig 


Balb/c 


0,3 ml 


0,14,24,34 




4. Protein 465, lOO fig + CT 10 fig 


Balb/c 


O^ml 


0, 14,24,34 




5. Protein 649, 100 fig + CT 10 fig 


Balb/c 


03 ml 


0,14,24,34 




6. Protein 658, 100 fig + CT 10 fig 


Balb/c 


O^ml 


0,14,24,34 




7. Protein 677, 100 fig + CT 10 |ig 


Balb/c 


0,3 ml 


0,14,24,34 


40 


8. Protein 685, 100 fig + CT 10 fig 


Balb/c 


0,3 ml 


0,14,24,34 




9. Protein 812, 100 fig + CT 10 fig 


Balb/c 


0,3 ml 


0, 14,24,34 




10. Protein 820, 100 fig + CT 10 fig 


Balb/c 


0,3 ml 


. 0,14,24,34 




1 1 . Protein 880, 100 fig + CT 1 0 fig 


Balb/c 


0,3 ml . 


0, 14,24,34 




12. Protein 865, 100 fig + CT 10 fig 


Balb/c 


03 ml 


0, 14,24,34 
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Analysis of infection 

Mucosal infection: The mice were sacrificed by CQ2 and cervical dislocation. The 
abdomen was opened and the stomach removed. After cutting the stomach along the 
greater curvature, it was rinsed in saline. The mucosa from the antrum and corpus of an 
5 area of 25mm 2 was scraped separately with a surgical scalpel. The mucosa scraping was 
suspended in Brucella broth and plated onto Blood Skirrow selective plates. The plates 
were incubated under microaerophilic conditions for 3-5 days and the number of colonies 
was counted. The identity of H pylori was ascertained by urease and catalase test and by 
direct microscopy or Gram staining. 

1 0 The urease test was performed essentially as follows. The reagent, Urea Agar Base 

Concentrate, was purchased from DIFCO Laboratories, Detroit, MI (Catalog # 0284-61-3). 
Urea agar base concentrate was diluted 1 :10 with water. 1 ml of if the diluted concentrate 
was mixed with 100-200 \& of actively growing H. pylori cells. Color change to magenta 
indicated that cells were urease positive. 

1 5 The catalase test was performed essentially as follows. The reagent, N^N 1 ^- 

Tetramethyl-p-Phenylenediamine, was purchased from Sigma, St Louis, MO (Catalog # 
T3 134). A solution of the regent (1% w/v in water) was prepared. H. pylori cells were 
swabbed onto Whatman filter paper and overlaid with the 1% solution. Color change to 
dark blue indicated that the cells were catalase positive. 

20 Serum antibodies: From all mice serum was prepared from blood drawn by heart 

puncture. Serum antibodies were identified by regular ELISA techniques, where the 
specific antigens of Helicobacter pylori were plated. 

Mucosal antibodies: Gentle scrapings of a defined part of the corpus and of 4 cm of 
duodenum were performed in 50% of the mice in order to detect the presence of antibodies 

25 in the mucous. The antibody titers were determined by regular ELISA technique as for 
serum antibodies. 

Statistical analysis: Wilcoxon-Mann-Whitney sign rank test was used for 
determination of significant effects of the antigens on Helicobacter pylori colonization. 
P<0.05 was considered significant. Because the antrum is the major colonization site for 
30 Helicobacter most emphasis was put upon changes in the antral colonization. 

Results 

Antibodies in sera : AH antigens tested given together with CT gave rise to a 
measurable specific titer in serum. The highest responses were seen with SEQ ID 
35 NOs:865, 812, 658, 447, and 820 (see Figure 1). 

Antibodies in mucus : In the mucus scrapings, specific antibodies against all 
antigens tested were seen. By far the strongest response was seen with SEQ ID NOs:685, 
followed by 447, 865, and 658 (see Figure 2). 
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Therapeutic immunization effects: 

All control animals (BALB/c mice) were well colonized with JZ pylori (strain 
AH244) in both antrum and corpus of the stomach. Of the antigens tested 3 proteins (SEQ 
ID NOs: 812, 820, and 447) gave a good and significant reduction and/or eradication of the 

5 R pylori infection. The degree of colonization of the antrum was lower following 
immunization with SEQ ID NOs: 880, 658, and 865 compared to control. The effect of 
SEQ ID NOs:465, 677, and 685 did not differ from control. The control protein lacZ, i.e. 
the non-A pylori protein, had no eradication effect and in fact had higher Helicobacter 
colonization compared to the HEPES + CT control. All data are shown in Figures 3 and 4 

10 for proteins dissolved in HEPES and DOC respectively. Data is shown as geometric mean 
values. n=8-10 Wilcoxon-Mann- Whitney sign rank test * = p<0.05; x/10 = number of mice 
showing eradication of H. pylori over the total number of mice examined. 

The data presented indicate that all of the R pylori associated proteins included in 
this study, when used as oral immunogens in conjunction with the oral adjuvant CT, 

1 5 resulted in stimulation of an immune response as measured by specific serum and mucosal 
antibodies. A majority of the proteins led' to a reduction, and in some cases complete 
clearance of the colonization oiR pylori in this animal model It should be noted that the 
reduction or clearance was due to heterologous protection rather than homologous 
protection (the polypeptides were based on the R pylori J99 strain sequence and used in the 

20 therapeutic immunization studies against a different (AH244) challenge strain), indicating 
the vaccine potential against a wide variety of R pylori strains. 

The highest colonization in the antrum was seen in animals treated with the non- 
Helicobacter protein LacZ, indicating that the effects seen with the Helicobacter pylori 
antigens were specific. 

25 Taken together these data strongly support the use of these H. pylori proteins in a 

pharmaceutical formulation for the use in humans to treat and/or prevent R pylori 
infections. 

V. Sequence Variance Analysis of genes in Helicobacter pylori strains 
30 Four genes were cloned and sequenced from several strains of H. pylori to compare 

the DNA and deduced amino acid sequences. This information was used to determine the 
sequence variation between the R pylori strain, J99, and other R pylori strains isolated 
from human patients. 

35 Preparation of Chromosomal DNA. 

Cultures of R pylori strains (as listed in Table 10) were grown in BLBB (1% 
Tryptone, 1% Peptamin 0.1% Glucose, 0.2% Yeast Extract 0.5% Sodium Chloride, 5% 
Fetal Bovine Serum) to an OD 600 of 0.2. Cells were centrifiiged in a Sorvall RC-3B at 
3500 x g at 4°C for 1 5 minutes and the pellet resuspended in 0.95 mis of 1 0 mM Tris-HCl, 
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0. 1 mM EDTA (TE). Lysozyme was added to a final concentration of lmg/ml along with, 
SDS to 1% and RNAse A + Tl to 0.5mg/ml and 5 units/ml respectively, and incubated at 

. 37°C for one hour. Proteinase K was then added to a final concentration of 0.4mg/ml and 
the sample was incubated at 55 C for more than one hour. NaCl was added to the sample 
5 to a concentration of 0.65 M, mixed carefully, and 0.15 ml of 10% CTAB in 0.7M NaCL 
(final is 1% CTAB/70mM NaCL) was added followed by incubation at 65°C for 20 
minutes. At this point, the samples were extracted with chloroformrisoamyl alcohol, 
extracted with phenol, and extracted again with chloroform:isoamyl alcohol. DNAwas 
precipitated with either EtOH (1 .5 x volumes) or isopropanol (0.6 x volumes) at -70°C for 

10 lOminutes, washed in 70% EtOH and resuspended in TE. 

PCR Amplification and cloning. 

Genomic DNA prepared from twelve strains of Helicobacter pylori was used as the 

source of template DNA for PCR amplification reactions (Current Protocols in Molecular 
15 Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1 994). To amplify a DNA 

sequence containing an H. pylori ORF, genomic DNA (10 nanograms) was introduced into 
. a reaction vial containing 2 mM MgCl 2 , 1 micromolar synthetic oligonucleotide primers 

(forward and reverse primers, see Table 8) complementary to and flanking a defined H. 

pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 
20 0.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., 

Branchburg, NJ, USA) in a final volume of 20 microliters in duplicate reactions. 

Table 8 

Oligonucleotide primers used for PCR amplification of H. pylori DNA sequences. 

25 



Outer membrane 
Proteins 


Forward primer 5* to 3' 


Reverse Primer 5' to 3' 


SEQIDNO:649 
(for strains AH4, 
AH15.AH6l.5294, 
5640, AH 18, and 
AH244) 


5-TTAACCATGGTGAAAAGC 
GATA-3' (SEQ ID NO:1919) 


5'-TAGAATTCGCCTCTAAAACT 
TTAG-3' (SEQ ID NO:1920) 


SEQH)NO:649 
(for strains AH5, 
5155, 7958, AH24, 
andJ99) ' 


S'-TTAACCATGGTGAAAAGC 
GATA-3' (SEQ ID NO:1921) 


5-TAGAATTCGCATAACGATCA 
ATC-3' (SEQ ID NO:1922) 


SEQIDNO:865 


5'-ATATCCATGGTGAGTTTGA 
TGA-3' (SEQ ID NO: 1923) 


y-ATGAA'lTCAA'rnTlTA'lTl 1' 
GCCA-3' (SEQ ID NO: 1924) 


SEQIDNO:677 


5'-AATTCCATGGCTATCCAAA 
TCCG-3* (SEQ ID NO:1925) 


5-ATGAATTCGCCAAAATCGTA 
GTATT-3' (SEQ ID NO: 1 926) 


SEQ ID NO:764 


5'-GATACCATGGAATTTATGA 
AAAAG-3' (SEQ ID NO:1927) 


5'-TGAATTCGAAAAAGTGTAGT 
TATAC-3' (SEQ ID NO: 1928) 



WO 96/40893 



PCTAJS96/09122 



-107- 

The following thermal cycling conditions were used to obtain amplified DNA • 
products for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal 
cycler: 

5 Sequences (by SEQ ID NO:) 865 and 764; 
Denaturation at 94°C for 2 nun, 

2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
23 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

10 

Sequence (by SEQ © NO:) 649 for strains AH5, 5155, 7958, AH24,and J99; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
25 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
15 Reaction was concluded at 72°C for 6 minutes. 

Sequences (by SEQ ID NO:) 677 and 649 for strains AH4, AH15, AH61, 5294, 5640, 
AH18,andHp244; 
Denaturation at 94°C for 2 min, 
20 2 cycles at 94°C for 15 sec, 30°C for 20 sec and 72°C for 2 min 
25 cycles at 94°C for 15 sec, 55°C for 20 sec and 72°C for 2 min 
Reactions were concluded at 72°C for 8 minutes. 

Upon completion of thermal cycling reactions, each pair of samples were combined 
25 and used directly for cloning into the pCR cloning vector as described below. 

Cloning ofH. pylori DNA sequences into the pCR TA cloning vector. 

All amplified inserts were cloned into the pCR 2.1 (pCRII in the case of H. pylori 
sequence 865) vector by the method described in the Original TA cloning kit (Invitrogen, 
30 San Diego, CA). Products of the ligation reaction were then used to transform the 
TOP10F' (INVaF in the case of H. pylori sequence 865) strain of E. coli as described 
below. 

Transformation of competent bacteria with recombinant plasmids 
35 Competent bacteria. E coli strain TOP10F' or E. coli strain INVaF' were 

transformed with recombinant pCR expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). Briefly, 2 microliters of 0.5 
micromolar BME was added to each vial of 50 microliters of competent cells. 
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Subsequently, 2 microliters f ligation reaction was mixed with the competent cells and 
incubated on ice for 30 minutes. The cells and ligation mixture were then subjected to a 
"heat shock" at 42°C for 30 seconds, and were subsequently placed on ice for an additional 
2 minutes, after which, samples were incubated in 0.45 milliliters SOC medium (0.5% 

5 yeast extract, 2.0 % tryptone, 1 0 mM NaCl, 2.5 mM KC1, 1 0 mM MgC12, 1 0 mM MgS04 
and 20, mM glucose) at 37°C with shaking for 1 hour. Samples were then spread on LB 
agar plates containing 25 microgram/ml kanamycin sulfate or 1 00 micrograms/ml 
ampicillan for growth overnight Transformed colonies of TOPI OF' or INVaF were then • 
picked and analyzed to evaluate cloned inserts as described below. 

1 0 Identification of recombinant PCR plasmids carrying H. pylori sequences 

Individual TOPI OF' or INVaF' clones transformed with recombinant pCR-H.pylori 
ORFs were analyzed by PCR amplification of the cloned inserts using the same forward 
and reverse primers, specific for each H. pylori sequence, that were used in the original 
PCR amplification cloning reactions. Successful amplification verified the integration of 

15 . the K pylori sequences in the cloning vector (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). 

Individual clones of recombinant pCR vectors carrying properly cloned H. pylori 
ORFs were picked for sequence analysis. Sequence analysis was performed on ABI 
Sequencers using standard protocols (Peikin Elmer) using vector-specific primers (as found 

20 . in PCRII or pCR2. 1, Invitrogen, San Diego, CA) and sequencing primers specific to the 
ORF as listed in Table 9 below. 
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Table 9 



Oligonucleotide primers used for sequen cing of H. pylori DNA ze^ w* 



Outer 
membrane 
Proteins 
SEQH>NO:649 



5'-CCCTTCATTTTAGAAATCG-3' 
(SEQIDNO:1929) 

5'-AnTCAACCAATTCAATGCG- 
3' (SEQ ID NO: 1930) 
5'-GCCCCTTTTGATTTGAAGCT- 
3* (SEQ ID NO: 1931) 
5'-TCGCTCCAAGATACCAAGA 
AGT-3' (SEQ ID NO:1932) 
5'-CTTGAATTAGGGGCAAAGA 
TCG-3' (SEQ ID NO:1933) 
S'-ATGCGTTnTACCCAAAGA 
AGT-3* (SEQ ID NO:1934) 
5'-ATAACGCCACTTCCTTATT 
GGT-3' (SEQ ID NO:1935) 



i'-Cl ITGGGTAAAAACGCATC-3 5 
(SEQ ID NO: 1936) 

S'-CGATCTTTGATCCTAATTCA- 
3'(SEQIDNO:1937) 

S'-ATCAAGTTGCCTATGCTGA-S' 
(SEQ ID NO: 1938) 



SEQH)NO:865 



5 '-TTG AAC ACTTTTGATTATG 
CGG-3' (SEQ ID NO:1939) 
S'-GGATTATGCGATTGTTTTAC 
AAG-3' (SEQ ID NO:1940) 



5'-GTCTTTAGCAAAAATGGC " 
GTC-3' (SEQ IDNO:1941) 

5-AATGAGCGTAAGAGAGCC 
TTC-3' (SEQ ID NO:1942) 



SEQIDNO:677 



5'-CTTATGGGGGTATTGTCA-3' 
(SEQIDNO:1943) 

5-AGCATGTGGGTATCCAGC-3' 
(SEQIDNO:1944) 



S'-AGGTrGTrGCGTAAAGACT-S' 
(SEQIDNO:1945) 

S'-CTGCCTCCACCTTTGATC-S' 
(SEQIDNO:1946) 



SEQH)NO:764 



5'-ACCAATATCAATTGGCACT-3 , 
(SEQIDNO:1947) 
S'-ACTrGGAAAAGCTCTGCA-S' 
(SEQ ID NO: 1948) 



5'-CTTGCTTGTCATATCTAGC-3' 
(SEQ ID NO: 1949) 

S'-GTTGAAGTGTTGGTGCTA-S* 
(SEQ ID NO: 1950) 



5'-CAAGCAAGTGGTTTGGTTT 
TAG-3'(SEQIDNO:1951) 
5'-TGGAAAGAGCAAATCATTG 
AAG-3' (SEQ ID NO: 1952) 



5'-GCCCATAATCAAAAAGCC 
CAT-3' (SEQ ID NO: 1 953) 
S'-CTAAAACCAAACCACTTGC 
TTGTC-3' (SEQ ID NO: 1954) 



Vector Primers 



S'-GTAAAACGACGGCCAG-T 
(SEQ ID NO: 1955) 



5'-CAGGAAACAGCTATGAC-3' 
(SEQIDNO:1956) 



10 



Results 

To establish the PCR error rate in these experiments, five individual clones of SEQ 
ID NO:649, prepared from five separate PCR reaction inixtures from H. pylori strain J99, 
were sequenced over a total length of 897 nucleotides for a cumulative total of 4485 bases 
of DNA sequence. DNA sequence for the five clones was compared to the DNA sequence 
of SEQ ID NO:649 obtained previously by a different method, i.e., random shotgun cloning 
and sequencing. The PCR error rate for the experiments described herein was determined 
to be 2 base changes out of 4485 bases, which is equivalent to an estimated error rate of 
less than or equal to 0.04%. 
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DNA sequence analysis was performed on four different open reading frames 
identified as genes and amplified by PCR methods from a dozen different strains of the 
bacterium Helicobacter pylori. The deduced amino acid sequences of three of the four 
open reading frames that were selected for this study showed statistically significant 

5 BLAST homology to defined proteins present in other bacterial species. Those ORFs 
included: SEQ ID NO:649, homologous to the val A & B genes encoding an ABC 
transporter in F. novicida; SEQ ID NO:865, homologous to lipoprotein e (P4) present in the 
outer membrane of H. influenzae; SEQ ID NO:677, homologous to fecA, an outer 
membrane receptor in iron (III) dicitrate transport in E. colL SEQ ID NO:764 was 

10 identified as an unknown open reading frame, because it showed low homology with 
sequences in the public databases. 

To assess the extent of conservation or variance in the ORFs across various strains 
of if. pylori, changes in DNA sequence and the deduced protein sequence were compared 
to the DNA and deduced protein sequences found in the J99 strain of H. pylori (see Table 

15 10 below). Results are presented as percent identity to the J99 strain of K pylori sequenced 
by random shotgun cloning. To control for any variations in the J99 sequence each of the 
four open reading frames were cloned and sequenced again from the J99 bacterial strain and 
that sequence information was compared to the sequence information that had been 
collected from inserts cloned by random shotgun sequencing of the J99 strain. The data 

20 demonstrate that there is variation in the DNA sequence ranging from as little as 0.12 % 
difference (SEQ ID NO:764, J99 strain) to approximately 7% change (SEQ ID NO:649, 
strain AH5). The deduced protein sequences show either no variation (SEQ ID NO:764, 
strains AH1 8 and AH24) or up to as much as 7.66% amino acid changes (SEQ ID NO:649, 
Strain AH5). 
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Table 10 

Multiple Strain DNA Sequence analysis of H. pylori Vaccine Candidates 

J99 Seo. 1D#: 649 649 865 865 677 677 764 764 

Length of 248 a.a. 746 nt 232 a.a. 696 nt 1 82 a.a. 548 nt 273 a.a. 819 nt 
Region 
Sequenced: 



Strain 
Tested 





AA 


Nuc. 


AA 


Nuc. 


AA 


Nuc. 


AA 


Nuc. 




identity 


identity 


identity 


identity 


identity 


identity 


identity 


identity 


J99 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


99.63% 


99.88% 


AH244 


95.16% 


95.04% 


n.d. 


n.d. 


99.09% 


96.71% 


98.90% 


96.45% 


AH4 


95.97% 


95.98% 


97.84% 


95.83% 


n.d. 


n.d. 


97.80% 


95.73% 


AH5 


92.34% 


93.03% 


9828% 


96.12% 


98.91% 


96.90% 


98.53% 


95.73% 


AH 15 


95.16% 


94.91% 


97.41% 


95.98% 


99.82% 


97.99% 


99.63% 


96.09% 


AH61 


n.d. 


n.d. 


97.84% 


95.98% 


99.27% 


97.44% 


n.d. 


n.d. 


5155 


n.d. 


n.d. 


n.d. 


n.d. 


99.45% 


97.08% 


98.53% 


95.60% 


5294 


94.35% 


94.37% 


9828% 


95.40% 


99.64% 


97.26% 


97.07% 


95.48% 


7958 


94.35% 


94.10% 


97.84% 


95.40% 


n.d. 


n.d. 


99.63% 


96.46% 


5640 


95.16% 


94.37% 


97.41%- 


95.69% 


99.09% 


97.63% 


98.53% 


95.48% 


AH 18 


n.d. 


n.d. 


98.71% 


95.69% 


99.64% 


97.44% 


100.00% 


95.97% 


AH24 


94.75% 


95.04% 


97.84% 


95.40% 


99.27% 


96.71% 


100.00% 


96.46% 


n.d. = not done 

















5 VI. Experimental Knock-Out Protocol for the Determination of Essential H. pylori Genes 
as Potential Therapeutic Targets 

Therapeutic targets are chosen from genes whose protein products appear to play 
key roles in essential cell pathways such as cell envelope synthesis, DNA synthesis, 
transcription, translation, regulation and colonization/virulence. 

1 0 The protocol for the deletion of portions of K pylori genes/ORFs and the 

insertional mutagenesis of a kanamycin-resistance cassette in order to identify genes which 
are essential to the cell is modified from previously published methods (Labigne-Roussel et 
al., 1988, J. Bacteriology 170, pp. 1704-1708; Cover et al.,1994, J. Biological Chemistry 
269, pp. 10566-10573; Reyrat et al, 1995, Proc. Natl. Acad. Sci. 92, pp 8768-8772). The 

15 result is a gene "knock-out" 

Identification and Cloning ofK pylori Gene Sequences 

The sequences of the genes or ORFs (open reading frames) selected as knock-out 
targets are identified from the K pylori genomic sequence and used to design primers to 
20 specifically amplify the genes/ORFs. All synthetic oligonucleotide primers are designed 
with the aid of the OLIGO program (National Biosciences, Inc., Plymouth, MN 55447, 
USA), and can be purchased from Gibco/BRL Life Technologies (Gaithersburg, MD, 
USA). If the ORF is smaller than 800 to 1000 base pairs, flanking primers are chosen 
outside of the open reading frame. 
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Genomic DNA prepared from the Helicobacter pylori HpJ99 strain (ATCC 55679^ . 
is used as the source of template DNA for amplification of the ORFs by PCR (polymerase 
chain reaction) (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., editors, 1 994). For the preparation of genomic DNA from H. pylori, see 

5 Example I. PCR amplification is carried out by introducing 1 0 nanograms of genomic 
HpJ99 DNA into a reaction vial containing 10 mM Tris pH 8.3, 50 mM KC1. 2 mM MgCl 2 , 
2 microMolar synthetic oligonucleotide primers (forward-Fl and reverse=Rl), 0.2 mM of 
each deoxynucleotide triphosphate (dATP,dGTP, dCTP, dTTP), and 1.25 units of heat 
stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) 

10 in a final volume of 40 microliters. The PCR is carried out with Perkin Elmer 
Cetus/GeneAmp PCR System 9600 thermal cyclers. 

Upon completion of thermal cycling reactions, each sample of amplified DNA is 
visualized on a 2% TAE agarose gel stained with Ethidium Bromide (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) to 

15 determine that a single product of the expected size had resulted from the reaction. 

Amplified DNA is then washed and purified using the Qiaquick Spin PCR purification kit 
(Qiagen, Gaithersburg, MD, USA). 

PCR products are cloned into the pT7Blue T-Vector (catalc$g#69820-l, Novagen, 
Inc., Madison, WI, USA) using the TA cloning strategy (Current Protocols in Molecular 

20 Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). The ligation of the 
PCR product into the vector is accomplished by mixing a 6 fold molar excess of the PCR ' 
product, 10 ng of pT7Blue-T vector (Novagen), 1 microliter of T4 DNA Ligase Buffer 
(New England Biolabs, Beverly, MA, USA), and 200 units of T4 DNA Ligase (New 
England Biolabs) into a final reaction volume of 10 microliters. Ligation is allowed to 

25 proceed for 16 hours at 16°C. 

■ Ligation products are electroporated (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) into electroporation-competent XL-1 
Blue or DH5-a Kcoli cells (Clontech Lab., Inc. Palo Alto, CA, USA). Briefly, 1 microliter 
of ligation reaction is mixed with 40 microliters of electrocompetent cells and subjected to 

30 a high voltage pulse (25 microFarads, 2.5 kV, 200 ohms) after which the samples are 

incubated in 0.45 ml SOC medium (0.5% yeast extract, 2% tryptone, 10 mM NaCl, 2.5 mM 
KC1, 10 mM MgCl 2 , 1 0 mM MgS0 4 and 20 mM glucose) at 37°C with shaking for 1 hour. 
Samples are then spread onto LB (10 g/1 bacto tryptone, 5 g/1 bacto yeast extract, 10 g/1 
sodium chloride) plates containing 100 microgram/ml of Ampicillin, 0.3% X-gal, and 100 

35 microgram/ml IPTG. These plates are incubated overnight at 37°C. Ampicillin-resistant 
colonies with white color are selected, grown in 5 ml of liquid LB containing 100 
microgram/ml of Ampicillin, and plasmid DNA is isolated using the Qiagen miniprep 
protocol (Qiagen, Gaithersburg, MD, USA). 
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To verify that the correct Kpylori DNA inserts had been cloned, these pT7Blue 
plasmid DNAs are used as templates for PCR amplification of the cloned inserts, using the 
same forward and reverse primers used for the initial amplification of the J99 Kpylori 
sequence. Recognition of the primers and a PCR product of the correct size as visualized 

5 on a 2% TAE, ethidium bromide stained agarose gel are confirmation that the correct 
inserts had been cloned Two to six such verified clones are obtained for each knock-out 
target, and frozen at -70°C for storage. To minimize errors due to PCR, plasmid DNA from 
these verified clones are pooled, and used in subsequent cloning steps. 

The sequences of the genes/ORFs are again used to design a second pair of primers 

10 which flank the region of K pylori DNA to be either interrupted or deleted (up to 250 
basepairs) within the ORFs but are oriented away from each other. The pool of circular 
plasmid DNAs of the previously isolated clones are used as templates for this round of 
PCR. Since the orientation of amplification of this pair of deletion primers is away from 
each other, the portion of the ORF between the primers is not included in the resultant PCR 

15 product. The PCR product is a linear piece of DNA with Kpylori DNA at each end and 
the pT7Blue vector backbone between them which, in essence, resultes in the deletion of a 
portion of the ORFs. The PCR product is visualized on a 1 % TAE, ethidium bromide 
stained agarose gel to confirm that only a single product of the correct size has been 
amplified. 

20 A Kanamycin-resistance cassette (Labigne-Roussel et al., 1988 J. Bacteriology 1 70, 

1704-1708) is ligated to this PCR product by the TA cloning method used previously 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., 
editors, 1994). The Kanamycin cassette containing a Campylobacter kanamycin resistance 
gene is obtained by carrying out an EcoRI digestion of the recombinant plasmid 

25 pCTB8:fcm (Cover et al.,1994, J. Biological Chemistry 269, pp. 10566-10573). The proper 
fragment (1 .4 kb) is isolated on a 1% TAE gel, and isolated using the QIAquick gel 
extraction kit (Qiagen, Gaithersburg, MD, USA). The fragment is end repaired using the 
Klenow fill-in protocol, which involved mixing 4ug of the DNA fragment, 1 microliter of 
dATP,dGTP, dCTP, dTTP at 0.5 mM, 2 microliter of Klenow Buffer (New England 

30 Biolabs) and 5 units of Klenow DNA Polymerase I Large (Klenow) Fragment (New 
England Biolabs) into a 20 microliter reaction, incubating at 30°C for 15 min, and 
inactivating the enzyme by heating to 75°C for 10 minutes. This blunt-ended Kanamycin 
cassette is then purified through a Qiaquick column (Qiagen, Gaithersburg, MD, USA) to 
eliminate nucleotides. The "T" overhang is then generated by mixing 5 micrograms of the 

35 blunt-ended kanamycin cassette, 10 mM Tris pH 8.3, 50 mM KC1, 2 mM MgCl 2 , 5 units of 
DNA Polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA), 20 
microliters of 5 mM dTTP; in a 1 00 microliter reaction and incubating the reaction for 2 
hours at 37°C. The "Kan-T" cassette is purified using a QIAquick column (Qiagen, 
Gaithersburg, MD, USA). The PCR product of the deletion primers (F2 and R2) is ligated 
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to the Kan-T cassette by mixing 10 to 25. ng of deletion primer PCR product, 50 - 75 ng 
Kan-T cassette DNA, 1 microliter lOx T4 DNA Ligase reaction mixture, 0.5 microliter T4 
DNA Ligase (New England Biolabs, Beverly, MA, USA) in a 10 microliter reaction and 
incubating for 16 hours at 16°C. 
5 The ligation products are transformed into XL-1 Blue or DH5-a Exoli cells by 

electroporation as described previously. After recovery in SOC, cells are plated onto LB 
plates containing 100 microgram/ml Ampicillin and grown overnight at 37°C. These plates 
are then replica plated onto plates containing 25 microgram/ml Kanamycin and allowed to 
grow overnight Resultant colonies have both the Ampicillin resistance gene present in the 

1 0 pT7Blue vector, and the newly introduced Kanamycin resistance gene. Colonies are picked 
into LB containing 25 microgram/ml Kanamycin and plasmid DNA is isolated from the 
cultured cells using the Qiagen miniprep protocol (Qiagen, Gaithersburg, MD, US A). 

Several tests by PCR amplification are conducted on these plasmids to verify that 
the Kanamycin is inserted in the K pylori gene/ORF, and to determine the orientation of 

15 the insertion of the Kanamycin-resistance gene relative to the K pylori gene/ORF. To 
verify that the Kanamycin cassette is inserted into the H. pylori sequence, the plasmid 
DNAs are used as templates for PCR amplification with the set of primers originally used 
to clone the K pylori gene/ORFs. The correct PCR product is the size of the deleted 
gene/ORF but increased in size by the addition of a 1 .4 kilobase Kanamycin cassette. To 

20 avoid potential polar effects of the kanamycin resistance cassette on H. pylori gene 

expression, the orientation of the Kanamycin resistance gene with respect to the knock-out 
gene/ORF is determined and both orientations are eventually used in H. pylori 
transformations (see below). To determine the orientation of insertion of the kanamycin 
resistance gene, primers are designed from the ends of the kanamycin resistance gene 

25 ("Kan-1 " 5 , -ATCTTACCTATCACCTCAAAT-3 , > and "Kan-2" 5'- 

AGACAGCAACATCTTTGTGAA-3 1 ). By using each of the cloning primers in 
conjunction with each of the Kan primers (4 combinations of primers), the orientation of 
the Kanamycin cassette relative to the Kpylori sequence is determined. Positive clones 
are classified as either in the "A" orientation (the same direction of transcription is present 

30 for both the H. pylori gene and the Kanamycin resistance gene), or in the "B" orientation 
(the direction of transcription for the Kpylori gene is opposite to that of the Kanamycin 
resistance gene). Clones which share the same orientation (A or B) are pooled for 
subsequent experiments and independently transformed into H pylori. 

35 Transformation of Plasmid DNA into H. pylori cells 

Two strains of K pylori are used for transformation: ATCC 55679. the clinical 
isolate which provided the DNA from which the H. pylori sequence database is obtained, 
and AH244, an isolate which had been passaged in, and has the ability to colonize the 
mouse stomach. Cells for transformation are grown at 37°C, 10% C0 2 , 100% humidity, 
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either on Sheep-Blood agar plates or in Brucella Broth liquid. Cells are grown to 
exponential phase, and examined microscopically to determine that the cells are "healthy" 
(actively moving cells) and not contaminated. If grown on plates, cells are harvested by 
scraping cells from the plate with a sterile loop, suspended in 1 ml of Brucella Broth, spun 
5 down (1 minute, top speed in eppendorf microfuge) and resuspended in 200 microliters 
Brucella Broth. If grown in Brucella Broth liquid, cells are centrifiiged (1 5 minutes at 3000 
rpm in a Beckman TJ6 centrifuge) and the cell pellet resuspended in 200 microliters of 
Brucella broth. An aliquot of cells is taken to determine the optical density at 600 nm, in 
order to calculate the concentration of cells. An aliquot (1 to 5 OD 6 oo units/25 microliter) 
10 of the resuspended cells is placed onto a prewanned Sheep-Blood agar plate, and the plate 
is further incubated at 37°C, 6% C0 2 , 100% humidity for 4 hours. After this incubation, 
10 microliters of plasmid DNA (100 micrograms per microliter) is spotted onto these cells. 
A positive control (plasmid DNA with the ribonuclease H gene disrupted by kanamycin 
resistance gene) and a negative control (no plasmid DNA) are done in parallel. The plates 
15 are returned to 37°C, 6% C0 2 for an additional 4 hours of incubation. Cells are then 
spread onto that plate using a swab wetted in Brucella broth, arid grown for 20 hours at 
37°C, 6% C0 2 . Cells are then transferred to a Sheep-Blood agar plate containing 25 
micrograms/ml Kanamycin, and allowed to grow for 3 to 5 days at 37°C, 6% C0 2 , 100% 
humidity. If colonies appear, they are picked and regrown as patches on a fresh Sheep- 
20 Blood agar plate containing 25 micrograms/ml Kanamycin. 

Three sets of PCR tests are done to verify that the colonies of transformants have 
arisen from homologous recombination at the proper chromosomal location. The template 
for PCR (DNA from the colony) is obtained by a rapid boiling DNA preparation method as 
follows. An aliquot of the colony (stab of the colony with a toothpick) is introduced into 
25 100 microliters of 1% Triton X-100, 20 mM Tris, pH 8.5, and boiled for 6 minutes. An 
equal volume of phenol : chloroform (1 :1) is added and vortexed. The mixture is 
microfuged for 5 minutes and the supernatant is used as DNA template for PCR with 
combinations of the following primers to verify homologous recombination at the proper 
chromosomal location. 

30 TEST 1 . PCR with cloning primers originally used to amplify the gene/ORF. A 

positive result of homologous recombination at the correct chromosomal location should 
show a single PCR product whose size is expected to be the size of the deleted gene/ORF 
but increased in size by the addition of a 1 .4 kilobase Kanamycin cassette. A PCR product 
of just the size of the gene/ORF is proof that the gene had not been knocked out and that 

35 the transformant is not the result of homologous recombination at the correct chromosome 
location. 

TEST 2. PCR with F3 (primer designed from sequences upstream of the gene/ORF 
and not present on the plasmid), and either primer Kan-1 or Kan-2 (primers designed from 
the ends of the kanamycin resistance gene), depending on whether the plasmid DNA used 
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was of "A" or "B" orientation. Homologpus recombination at the correct chromosomal 
location will result in a single PCR product of the expected size (i.e., from the location of 
F3 to the insertion site of kanamycin resistance gene). No PCR product or PCR produces) 
of incorrect size(s) will prove that the plasmid had not integrated at the correct site and that 

5 the gene had not been knocked out. 

TEST 3. PCR with R3 (primer designed from sequences downstream of the 
gene/ORF and not present on the plasmid) and either primer Kan-1 or Kan-2, depending on 
whether the plasmid DNA used was of "A" or "B" orientation. Homologous recombination 
at the correct chromosomal location will result in a single PCR product of the expected size 

10 (i.e., from the insertion site of kanamycin resistance gene to the downstream location of 
R3). Again, no PCR product or PCR product(s) of incorrect size(s) will prove that the 
plasmid had not integrated at the correct site and that the gene had not been knocked out. 

Transformants showing positive results for all three tests above indicate that the 
gene is not essential for survival in vitro. 

15 A negative result in any of the three above tests for each transformant indicates that 

the gene had not been disrupted, and that the gene is essential for survival in vitro. 

In the event that no colonies result from two independent transformations while the 
positive control with the disrupted ribonuclease H plasmid DNA produces transformants, 
the plasmid DNA is further analyzed by PCR on DNA from transformant populations prior 

20 to plating for colony formation. This will verify that the plasmid can enter the cells and 
undergo homologous recombination at the correct site. Briefly, plasmid DNA is incubated 
according to the transformation protocol described above. DNA is extracted from the H. 
pylori cells immediately after incubation with the plasmid DNAs and the DNA is used as 
template for the above TEST 2 and TEST 3. Positive results in TEST 2 and TEST 3 would 

25 verify that the plasmid DNA could enter the cells and undergo homologous recombination 
at the correct chromosomal location. If TEST 2 and TEST 3 are positive, then failure to 
obtain viable transformants indicates that the gene is essential, and cells suffering a 
disruption in that gene are incapable of colony formation 

30 VII. Hish-throughput drug screen assay 

Cloning, expression and protein purification 

Cloning, transformation, expression and purification of the H. pylori target gene and 
its protein product,e.g., an H. pylori enzyme, to be used in a high-throughput drug screen 
35 assay, is carried out essentially as described in Examples II and III above. Development 
and application of a screening assay for a particular H. pylori gene product, peptidyl-propyl 
cis-trans isomerase, is described below as a specific example. 
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Enzymatic Assay 

The assay is essentially as described by Fisher (Fischer, G., etal. (1984) Biomed 
Biochinu Acta 43:1 101-1 1 1 1). The assay measures the cis-trans isomerization of the Ala- 
Pro bond in the test peptide N-succinyl-Ala-Ala-Pro-Phe-p-nitroanilide (Sigma # S-7388, 

5 lot # 84H5805). The assay is coupled with a-chymotrypsin, where the ability of the 
protease to cleave the test peptide occurs only when the Ala-Pro bond is in trans. The 
conversion of the test peptide to the trans isomer in the assay is followed at 390 nm on a 
Beckman Model DU-650 spectophotometer. The data are collected every second with an 
average scanning of time of 0.5 second. Assays are carried out in 35 mM Hepes, pH 8.0, in 

10 a final volume of 400 ul, with 10 \sM a-chymotrypsin (type 1-5 from bovine Pancreas, 
Sigma # C-7762, lot 23H7020) and 10 nM PPIase. To initiate the reaction, 10 pi of the 
substrate ( 2 mM N-Succinyl-Ala-Ala-Pro-Phe-p-nitroanilide in DMSO) is added to 390 |xl 
of reaction mixture at room temperature. 

15 Enzymatic assay in crude bacterial extract 

A 50 ml culture of Helicobacter pylori (strain J99) in Brucella broth is harvested at 
mid-log phase (OD 6oo nm ~ *) ^ resuspended in lysis buffer with the following protease 
inhibitors: 1 mM PMSF, and 10 jig/ml of each of aprotinin, leupeptin, pepstatine, TLCK, 
TPCK, and soybean trypsin inhibitor. The suspension is subjected to 3 cycles of freeze- 

20 thaw (15 minutes at -70 ° C, then 30 minutes at room temperature), followed by sonication 
(three 20 second bursts). The lysate is centrifuged (12,000 g x 30 minutes) and the 
supernatant is assayed for enzymatic activity as described above. 

Many H. pylori enzymes can be expressed at high levels and in an active form in E. 
colu Such high yields of purified proteins provide for the design of various high 

25 throughput drug screening assays. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments and methods 
30 described herein. Such equivalents are intended to be encompassed by the scope of the 
following claims. 
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SEQUENCE LISTING 



(1) GENERAL. INFORMATION: 

(i) APPLICANT: 

(A) NAME: Astra Aktiebolag 

(B) STREET: S-151 85 

(C) CITY: Sodertalje 
ID) STATE: 

(E) COUNTRY: Sweden 

(F) POSTAL CODE (ZIP) : 

(ii) TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES 

RELATING TO HELICOBACTER PYLORI FOR 
DIAGNOSTICS AND THERAPEUTICS 

(iii) NUMBER OF SEQUENCES: 1956 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 8 -mm cartridge tape 

(B) COMPUTER: SPARC station LX 

<C) OPERATING SYSTEM: SunOS Release 4.1.3 

(D) SOFTWARE: tar 

(V) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US96/09122 
(B) FILING DATE: June 6, 1996 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/630,405 

(B) " FILING DATE: 01-APR-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/561,469 

(B) FILING DATE: 17-NOV-1995 

(viii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/487,032 

(B) FILING DATE: 07-JUNE-1995 

(ix) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 60 State Street, Suite 510 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109-1875 

(X) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Mandragouras , Amy E. 

(B) REGISTRATION NUMBER: 36,207 

(C) REFERENCE/DOCKET NUMBER: GTN-001C5PC 

(xi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)227-5941 
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1 . An isolated nucleic acid comprising a nucleotide sequence encoding an H. 
pylori cell envelope polypeptide or a fragment thereof, said nucleic acid selected from the 
5 group consisting of SEQ ED NO: 1020, SEQ ID NO: 1021, SEQ ED NO: 1036, SEQ ED 
NO: 1050, SEQ ED NO: 1071, SEQ ED NO: 1101, SEQ ED NO: 1135, SEQ ED NO: 1276, 
SEQEDNO: 1150,SEQEDNO: 1187, SEQ ED NO: 1192, SEQ ED NO: 1361.SEQED 
NO: 1379, SEQ ED NO: 1399, SEQ ED NO: 1403, SEQ ED NO: 1400, SEQ ED NO: 1 189, 
SEQ ED NO: 1002, SEQ ED NO: 1213, SEQ ED NO: 1214, SEQ ED NO: 1215, SEQ ED 
10 NO: 1234, SEQ ED NO: 1236, SEQ ED NO! 1237, SEQ ED NO: 1224, SEQ ED NO: 125 1, 
SEQ ED NO: 1262, SEQ ED NO: 1149, SEQ ED NO: 1220, SEQ ED NO: 1240, SEQ ED 
NO: 1 164, SEQ ED NO: 1 165, SEQ ED NO: 1404, SEQ ED NO: 1 144, SEQ ED NO: 1 182, 
SEQ It) NO: 1 157, SEQ ED NO: 1 160, SEQ ED NO: 1300, SEQ ED NO: 1321, SEQ ED 
NO: 1323, SEQ ED NO: 1329, SEQ ED NO: 1332, SEQ ED NO: 1345, SEQ ED NO: 1358, 
15 SEQ ID NO: 1375, SEQ ED NO: 1417, SEQ ID NO: 1283, SEQ ED NO: 1335, SEQ ED 
NO: 1368, SEQ ED NO: 1 179, SEQ ED NO: 1255, SEQ ED NO: 1258, SEQ ED NO: 1 044, 
SEQ ED NO: 1273, SEQ ED NO: 1219, SEQ ED NO: 1274, SEQ ED NO: 1210, SEQ ED 
NO: 1422, SEQ ID NO: 1302, SEQ ED NO: 1308, SEQ ED NO: 1310, SEQ ED NO: 133 1, 
SEQ ID NO: 1432, SEQ ED NO: 1052, SEQ ED NO: 1091, SEQ ED NO: 1421, SEQ ED 
20 NO: 1069, SEQ ED NO: 1005, SEQ ED NO: 1007, SEQ ED NO: 1 166, SEQ ED NO: 1 177, 
SEQ ED NO: 1 193, SEQ ED NO: 1206, SEQ ED NO: 1207, SEQ ED NO: 1304, SEQ ED 
NO: 1305, SEQ ED NO: 1346, SEQ ED NO: 1348, SEQ ED NO: 1350, SEQ ED NO: 1032, 
SEQEDNO: 1053, SEQ ED NO: 1081,SEQEDNO: 1124, SEQ ED NO: 1382, SEQ ED 
NO: 1437, SEQ ID NO: 1263, SEQ ED NO: 1 173, SEQ ED NO: 1405, SEQ ID NO: 1406, 
25 SEQ ED NO: 1410, SEQ ED NO: 1 086, SEQ ED NO: 1322, SEQ ED NO: 1266, SEQ ED 
NO: 1282, SEQ ED NO: 1271, SEQ ID NO: 1208, SEQ ED NO: 1 126, SEQ ED NO: 1270, 
SEQ ED NO: 1278, SEQ ED NO: 1419, SEQ ED NO: 1125, SEQ ED NO: 1181, SEQ ED 
NO: 1416, SEQ ED NO: 1096, SEQ ED NO: 1082, SEQ ED NO: 1 146, SEQ ED NO: 1 145, 
SEQ ED NO: 1 108, SEQ ED NO: 1 148, SEQ ED NO: 1337, SEQ ED NO: 1338, SEQ ED 
30 NO: 1424, SEQ ED NO: 1000, SEQ ED NO: 1027, SEQ ED NO: 1 175, SEQ ID NO: 1330, 
SEQ ED NO: 217, SEQ ED NO: 217, SEQ ED NO: 367, SEQ ED NO: 91 1,SEQ ID NO: 944, 
SEQ ED NO: 18, SEQ ID NO: 107, SEQ ED NO: 894, SEQ ID NO: 943, SEQ ED NO: 203, 
SEQ ID NO: 85, SEQ ED NO: 290, SEQ ED NO: 5, SEQ ED NO: 199, SEQ ED NO: 992, 
SEQ ID NO: 934, SEQ ID NO: 899, SEQ ID NO: 302, SEQ ID NO: 215, SEQ ID NO: 
35 893, SEQ ID NO: 984, SEQ ID NO: 97.SEQ ID NO: 22, SEQ ID NO: 49, SEQ ID NO: 
309, SEQ ID NO: 150, SEQ ID NO: 240, SEQ ID NO: 957, SEQ ID NO: 57, SEQ ID NO: 
2, SEQ ID NO: 92, SEQ ID NO: 255, SEQ ID NO: 1 64, SEQ ID NO: 201 , SEQ ID NO: 
278, SEQ ID NO: 245, SEQ ID NO: 921 , SEQ ID NO: 896, SEQ ID NO: 248, SEQ ID 
NO: 159, SEQ ID NO: 979, SEQ ID NO: 194, SEQ ID NO: 194, SEQ ID NO: 946, SEQ 
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(B) LOCATION 1. . .17 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1956 
CAGGAAACAG CTATGAC 17 
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ID NO: 91 6, SEQ ID NO: 76, SEQ ID NO: 905, SEQ ID NO: 914, SEQ ID NO: 93 1, SEQ 
ID NO: 50, SEQ ID NO: 250, SEQ ID NO: 969, SEQ ID NO: 66.SEQ ID NO: 275, SEQ 
ID NO: 330, SEQ ID NO: 204, SEQ ID NO: 383, SEQ ID NO: 303, SEQ ID NO: 70, SEQ 
ID NO: 983, SEQ ID NO: 972, SEQ ID NO: 929, SEQ ID NO: 972, SEQ ID NO: 936, 

5 SEQ ID NO: 267, SEQ ID NO: 197, SEQ ID NO: 55, SEQ ID NO: 54, SEQ ID NO: 210, 
SEQ ID NO: 90, SEQ ID NO: 15, SEQ ID NO: 913, SEQ ID NO: 227, SEQ ID NO: 79, 
SEQ ID NO: 191, SEQ ID NO: 238, SEQ ID NO: 274, SEQ ID NO: 27, SEQ ID NO: 258, 
SEQ ID NO: 295, SEQ ID NO: 10, SEQ ID NO: 160, SEQ ID NO: 225, SEQ ID NO: 964, 
SEQ ID NO: 166, SEQ ID NO: 56, SEQ ID NO: 980,SEQ ID NO: 903, SEQ ID NO: 261, 

10 SEQ ID NO: 71, SEQ ID NO: 955, SEQ ID NO: 361, SEQ ID NO: 58, SEQ ID NO: 114, 
SEQ ID NO: 940, SEQ ID NO: 960, SEQ ID NO: 144, SEQ ID NO: 362, SEQ ID NO: 40, 
SEQ ID NO: 285, SEQ ID NO: 1 1, SEQ ID NO: 161, SEQ ID NO: 974, SEQ ED NO: 111, 
SEQ ID NO: 316, SEQ ID NO: 257, SEQ ID NO: 78, SEQ ID NO: 966, SEQ ID NO: 352, 
SEQ ID NO: 981, SEQ ID NO: 158, SEQ ID NO: 989, SEQ ID NO: 963, SEQ ED NO: 48, 

15 SEQ ID NO: 68, SEQ ID NO: 135, SEQ ED NO: 910, SEQ ED NO: 236, SEQ ID NO: 241,' 
SEQ ED NO: 949, SEQ ED NO: 945, SEQ ED NO: 207, SEQ ED NO: 977, SEQ ED NO: 
978, SEQ ED NO: 994, SEQ ED NO: 163, SEQ ED NO: 256, SEQ ID NO: 287, SEQ ED 
NO: 184, SEQ ED NO: 45, SEQ ED NO: 136, SEQ ED NO: 214, SEQ ED NO: 16, SEQ ED 
NO: 192, SEQ ED NO: 373, SEQ ED NO: 892, SEQ ID NO: 239, SEQ ID NO: 34, SEQ EO 

20 NO: 340, SEQ ED NO: 41, SEQ EO NO: 332, SEQ ED NO: 134, and SEQ ID NO: 330. 

2. The purified nucleic acid of claim 1, wherein said H. pylori cell envelope 
polypeptide or a fragment thereof is an H. pylori flagella-associated polypeptide or a 
fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ED 

25 NO: 1020, SEQ ED NO: 1021, SEQ ED NO: 1036, SEQ ID NO: 1050, SEQ EO NO: 1071, 
SEQ ED NO: 1 101, SEQ ED NO: 1 135, SEQ ED NO: 1276, SEQ ED NO: 1 150, SEQ ED 
NO: 1 1 87, SEQ ED NO: 1 192, SEQ ED NO: 1361, SEQ ED NO: 1379, SEQ ED NO: 1399, 
SEQ ED NO: 1403, SEQ ED NO: 1400, SEQ ED NO: 11 89, SEQ ED NO: 21 7, SEQ ED NO: 
367, SEQ ED NO: 911,SEQ ED NO: 944, SEQ ID NO: 18, SEQ EO NO: 107, SEQ ED NO: 

30 894, SEQ ED NO: 943, SEQ ED NO: 203, SEQ ED NO: 85, SEQ ED NO: 290, SEQ ID NO: 
5, SEQ EO NO: 199, SEQ ED NO: 992, SEQ ED NO: 934, SEQ ID NO: 899, SEQ ED NO: 
302, and SEQ ID NO: 215. 

3. The purified nucleic acid of claim 1, wherein said if. pylori cell envelope 

35 polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide or a fragment 
thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 
1002, SEQ ED NO: 1213, SEQ ID NO: 1214, SEQ ID NO: 1215, SEQ ID NO: 1234, SEQ 
ID NO: 1236, SEQ ID NO: 1237, SEQ ID NO: 1224, SEQ ID NO: 1251, SEQ ID NO: 
1262, SEQ ID NO: 1 149, SEQ ID NO: 1220, SEQ ID NO: 1240, SEQ ID NO: 1 164, SEQ 

40 ID NO: 1 165, SEQ ID NO: 1404, SEQ ID NO: 1 144, SEQ ID NO: 1 182, SEQ ID NO: 
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1 157, SEQ ID NO: 1 160, SEQ ID NO: 1300, SEQ ID NO: 1321, SEQ ID NO: 1323, SEQ 
ID NO: 1329, SEQ ID NO: 1332, SEQ ID NO:1345, SEQ ID NO: 1358, SEQ ID NO: 
1375, SEQ ID NO: 1417, SEQ ID NO: 1283, SEQ ID NO: 1335, SEQ ID NO: 1368, SEQ 
ID NO: 1 179, SEQ ID NO: 1255, SEQ ID NO: 1258, SEQ ID NO: 1044, SEQ ID NO: 

5 1273, SEQ ID NO: 893, SEQ ID NO: 984, SEQ ID NO: 97,SEQ ID NO: 22, SEQ ID NO: 
49, SEQ ID NO: 309, SEQ ID NO: 150, SEQ ID NO: 240, SEQ ID NO: 957, SEQ ID NO: 
57, SEQ ID NO: 2, SEQ ID NO: 92, SEQ ID NO: 255, SEQ ID NO: 164, SEQ ID NO: 
201, SEQ ID NO: 278, SEQ ID NO: 245, SEQ ID NO: 921, SEQ ID NO: 896, SEQ ID 
NO: 248, SEQ ID NO: 159, SEQ ID NO: 979, SEQ ID NO: 194, SEQ ID NO: 194, SEQ 

10 ID NO: 946, SEQ ID NO: 916, SEQ ID NO: 76, SEQ ID NO: 905, SEQ ID NO: 914, SEQ 
ID NO: 931, SEQ ID NO: 50, SEQ ID NO: 250, SEQ ID NO: 969; SEQ ID NO: 66,SEQ 
ID NO: 275, SEQ ID NO: 330, SEQ ID NO: 204, SEQ ID NO: 383, SEQ ID NO: 303, 
SEQ ID NO: 70, SEQ ID NO: 983, SEQ ID NO: 972, SEQ ID NO: 929, SEQ ID NO: 972, 
SEQ ID NO: 936, SEQ ID NO: 267, SEQ ID NO: 197, SEQ ID NO: 55, SEQ ID NO: 54, 

15. and SEQ ID NO: 210. 

4. The purified nucleic acid of claim 1, wherein said H. pylori cell envelope 
polypeptide or a fragment thereof is an H. pylori transporter polypeptide or a fragment 
thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 

20 1219, SEQ ID NO: 1274, SEQ ID NO: 1210, SEQ ID NO: 1422, SEQ ID NO: 1302, SEQ 
ID NO: 1308, SEQ ID NO: 1310, SEQ ID NO: 1331, SEQ ID NO: 1432, SEQ ID NO: 
1052, SEQ ID NO: 1091, SEQ ID NO: 1421, SEQ ID NO: 1069, SEQ ID NO: 1005, SEQ 
ID NO: 1007, SEQ ID NO: 1 166, SEQ ID NO: 1 177, SEQ ID NO: 1 193, SEQ ED NO: 
1206, SEQ ID NO: 1207, SEQ ID NO: 1304, SEQ ID NO: 1305, SEQ ED NO: 1346, SEQ 

25 ID NO: 1348, SEQ ID NO: 1350, SEQ ED NO: 1032, SEQ ED NO: 1053, SEQ ED NO: 
1081, SEQ ID NO: 1 124, SEQ ED NO: 1382, SEQ ED NO: 1437, SEQ ID NO: 1263, SEQ 
ED NO: 90, SEQ ID NO: 15, SEQ ED NO: 913, SEQ ED NO: 227, SEQ ED NO: 79, SEQ ED 
NO: 191, SEQ ED NO: 238, SEQ ID NO: 274, SEQ ID NO: 27, SEQ ID NO: 258, SEQ ED 
NO: 295, SEQ ID NO: 10, SEQ ED NO: 160, SEQ ED NO: 225, SEQ ID NO: 964, SEQ ID 

30 NO: 1 66, SEQ ED NO: 56, SEQ ED NO: 980,SEQ ID NO: 903, SEQ ED NO: 261 , SEQ ID 
NO: 71, SEQ ID NO: 955, SEQ ED NO: 361, SEQ ID NO: 58, SEQ ED NO: 1 14, SEQ ID 
NO: 940, SEQ ID NO: 960, SEQ ED NO: 144, SEQ ID NO: 362, SEQ ID NO: 40, SEQ ID 
NO: 285, SEQ ID NO: 1 1, SEQ ID NO: 161, SEQ ID NO: 974, SEQ ID NO: 1 1 1, SEQ ED 
NO: 316, SEQ ID NO: 257, SEQ ID NO: 78, and SEQ ED NO: 966. 

35 

5. The purified nucleic acid of claim 1 , wherein said H. pylori cell envelope 
polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide or a fragment 
thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 
1 173, SEQ ID NO: 1405, SEQ ID NO: 1406, SEQ ID NO: 1410, SEQ ID NO: 1086, SEQ 
40 ID NO: 1322, SEQ ID NO: 1266, SEQ ID NO: 1282, SEQ ID NO: 1271, SEQ ID NO: 
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1208, SEQ ID NO: 1 126, SEQ ID NO: 1270, SEQ ID NO: 1278, SEQ ID NO: 1419, SEQ 
ID NO: 1 125, SEQ ID NO: 1 181, SEQ ID NO: 1416, SEQ ID NO: 1096, SEQ ID NO: 
1082, SEQ ID NO: 1146, SEQ ID NO: 1145, SEQ ID NO: 1108, SEQ ID NO: 1148, SEQ 
ID NO: 1337, SEQ ID NO: 1338, SEQ ID NO: 1424, SEQ ID NO: 1000, SEQ ID NO: 
5 1027, SEQ ID NO: 1 175, SEQ ID NO: 1330, SEQ ID NO: 352, SEQ ID NO: 981, SEQ ID 
NO: 158, SEQ ID NO: 989, SEQ ID NO: 963, SEQ ID NO: 48, SEQ ID NO: 68, SEQ ID 
NO: 135, SEQ ID NO: 910, SEQ ID NO: 236, SEQ ID NO: 241, SEQ ID NO: 949, SEQ 
ID NO: 945, SEQ ID NO: 207, and SEQ ID NO: 977. 

10 6. A recombinant expression vector comprising the nucleic acid of claim 1 

operably linked to a transcription regulatory element 

7. A cell comprising a recombinant expression vector of claim 6. 

15 8. A method for producing an H. pylori polypeptide comprising culturing a cell 

of claim 7 under conditions that permit expression of the polypeptide. 

9. An isolated nucleic acid comprising a nucleotide sequence encoding an H. 
pylori cytoplasmic polypeptide or a fragment thereof, said nucleic acid selected from the 
20 group consisting of SEQ ID NO: 1 147, SEQ ID NO: 1288, SEQ ID NO: 1324, SEQ ID 
NO: 1 363, SEQ ID NO: 997, SEQ ID NO: 1015, SEQ ID NO: 1084, SEQ ID NO: 1094, 
SEQ ID NO: 1099, SEQ ID NO: 1229, SEQ ID NO: 1250, SEQ ID NO: 1268, SEQ ID 
NO: 1293, SEQ ID NO: 1339, SEQ ID NO: 1408, SEQ ID NO: 1429, SEQ ID NO: 1434, 
SEQ ID NO: 1228, SEQ ID NO: 1031, SEQ ID NO: 1034, SEQ ID NO: 1008, SEQ ID 
25 NO: 1061, SEQ ID NO: 1064, SEQ ID NO: 1 191, SEQ ID NO: 1217, SEQ ID NO: 1365, 
SEQ ID NO: 1394, SEQ ID NO: 1414, SEQ ID NO: 1415, SEQ ID NO: 1435, SEQ ID 
NO: 1058, SEQ ID NO: 1059.SEQIDNO: 1080, SEQ ID NO: 1128,SEQIDNO: 1133, 
SEQ ID NO: 121 1, SEQ ID NO: 1252, SEQ ID NO: 1253, SEQ ID NO: 1286, SEQ ID 
NO: 1289, -SEQ ID NO: 1291, SEQ ID NO: 1303, SEQ ID NO: 1396, SEQ ID NO: 996, 
30 SEQ ID NO: 1095, SEQ ID NO: 1 156, SEQ ID NO: 1 158, SEQ ID NO: 1 159, SEQ ID 
NO: 1277, SEQ ID NO: 1038, SEQ ID NO: 1257, SEQ ID NO: 1357, SEQ ID NO: 1436, 
SEQ ID NO: 1047, SEQ ID NO: 1055, SEQ ID NO: SEQ ID NO: 1 141, SEQ ID NO: 
1227, SEQ ID NO: 1327, SEQ ID NO: 1412, SEQ ID NO: 1003, SEQ ID NO: 1087, SEQ 
ID NO: 1 1 16, SEQ ID NO: 1 130, SEQ ID NO: 1 132, SEQ ID NO: 1185, SEQ ID NO: 
35 1 188, SEQ ID NO: 1 198, SEQ ID NO: 1218, SEQ ID NO: 1244, SEQ ID NO: 1306, SEQ 
ID NO: 1325, SEQ ID NO: 1397, SEQ ID NO: 1398, SEQ ID NO: 1407, SEQ ID NO: 
1433, SEQ ID NO: 1216, SEQ ID NO: 1239, SEQ ID NO: 1362, SEQ ID NO: 1017, SEQ 
ID NO: 1019, SEQ ID NO: 1360, SEQ ID NO: 1423, SEQ ID NO: 1425, SEQ ID NO: 
1 374, SEQ ID NO: 1028, SEQ ID NO: 1 037, SEQ ID NO: 1 077, SEQ ID NO: 1115, SEQ 
40 ID NO: 1232, SEQ ID NO: 1241, SEQ ID NO: 1267, SEQ ID NO: 1 163, SEQ ID NO: 
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1068, SEQ ID NO: 1025, SEQ ID NO: 1042, SEQ ID NO: 1046, SEQ ID NO: 1056, SEQ 
ID NO: 1039, SEQ ID NO: 1072, SEQ ID NO: 1073, SEQ ID NO: 1092, SEQ ID NO: 
1100, SEQ IDNO: 1102, SEQ ID NO: 1103,SEQIDNO: 1104, SEQ ID NO: 1111, SEQ 
ID NO: 1119.SEQIDNO: 1136, SEQ ID NO: 1137, SEQ ID NO: 1140, SEQ IDNO: 

5 1 142, SEQ ID NO: 1233, SEQ ID NO: 1238, SEQ ID NO: 1243, SEQ ID NO: 1245, SEQ 
ID NO: 1247, SEQ ID NO: 1249, SEQ ID NO: 1261, SEQ ID NO: 1269, SEQ ID NO: 
1279, SEQ ID NO: 1284, SEQ ID NO: 1290, SEQ ID NO: 1297, SEQ ID NO: 1328, SEQ 
ID NO: 1370, SEQ ID NO: 1372, SEQ ID NO: 1377, SEQ ID NO: 1383, SEQ ID NO: 
1384, SEQ ID NO: 1385, SEQ ID NO: 1388, SEQ ID NO: 1401 , SEQ ID NO: 1402, SEQ 

10 ID NO: 1418, SEQ ID NO: 1420, SEQ ID NO: 1427, SEQ ID NO: 1070, SEQ ID NO: 
1 151, SEQ ID NO: 1 176, SEQ ID NO: 999, SEQ ID NO: 1006, SEQ ID NO: 1012, SEQ 
ID NO: 1018, SEQ ID NO: 1030, SEQ ID NO: 1033, SEQ ID NO: 1041, SEQ ID NO: 
1049, SEQ ID NO: 1054, SEQ ID NO: 1057, SEQ ID NO: 1090, SEQ ID NO: 1097, SEQ 
ID NO: 1 129, SEQ ID NO: 1 139, SEQ ID NO: 1 143, SEQ ID NO: 1 152, SEQ ID NO: 

15 1 153, SEQ ID NO: 1 155, SEQ ID NO: 1 161, SEQ ID NO: 1 162, SEQ ID NO: 1 169, SEQ 
ID NO: 1 170, SEQ ID NO: 1 171, SEQ ID NO: 1 180, SEQ ID NO: 1 194, SEQ ID NO: 
1 195, SEQ ED NO: 1 199, SEQ ID NO: 1200, SEQ ID NO: 1201 , SEQ ID NO: 1202, SEQ 
ID NO: 1205, SEQ ID NO: 1312, SEQ ID NO: 1336, SEQ ID NO: 1349, SEQ ID NO: 
1355, SEQ ID NO: 1359, SEQ ID NO: 1413, SEQ ID NO: 1426, SEQ ID NO: 1430, SEQ 

20 ID NO: 882, SEQ ID NO: 382, SEQ ID NO: 130, SEQ ID NO: 230, SEQ ID NO: 269, 
SEQ ID NO: 312, SEQ ID NO: 21 1, SEQ ID NO: 959, SEQ ID NO: 938, SEQ ID NO: 
1 10, SEQ ID NO: 244, SEQ ID NO: 328, SEQ ID NO: 235, SEQ ID NO: 3 15, SEQ ID 
NO: 296, SEQ ID NO: 976, SEQ ID NO: 321, SEQ ID NO: 43, SEQ ID NO: 281, SEQ ID 
NO: 326, SEQ ID NO: 272, SEQ ID NO: 344, SEQ ID NO: 139, SEQ ID NO: 30, SEQ ID 

25 NO: 220, SEQ ID NO: 364, SEQ ID NO: 369, SEQ ID NO: 372, SEQ ID NO: 991, SEQ 
ID NO: 128, SEQ ID NO: 347, SEQ ID NO: 52, SEQ ID NO: 12, SEQ ID NO: 247, SEQ 
ID NO: 64, SEQ ID NO: 101, SEQ ID NO: 338, SEQ ID NO: 83, SEQ ID NO: 46, SEQ ID 
NO: 348, SEQ ID NO: 223, SEQ ID NO: 39, SEQ ID NO: 232, SEQ ID NO: 1 68, SEQ ID 
NO: 65, SEQ ID NO: 952, SEQ ID NO: 341, SEQ ID NO: 69, SEQ ID NO: 924, SEQ ID 

30 NO: 4, SEQ ID NO: 197, SEQ ID NO: 313, SEQ ID NO: 119, SEQ ID NO: 188, SEQ ID 
NO: 956, SEQ ID NO: 935, SEQ ID NO: 246, SEQ ID NO: 196, SEQ ID NO: 376, SEQ 
ID NO: 172, SEQ ID NO: 25, SEQ ID NO: 126, SEQ ID NO: 951, SEQ ID NO: 147, SEQ 
ID NO: 895, SEQ ID NO: 14, SEQ ID NO: 154, SEQ ID NO: 277, SEQ ID NO: 363, SEQ 
ID NO: 342, SEQ ID NO: 378, SEQ ID NO: 130, SEQ ID NO: 198, SEQ ID NO: 243, 

35 SEQ ID NO: 19, SEQ ID NO: 9, SEQ ID NO: 149, SEQ ID NO: 167, SEQ ID NO: 349, 
SEQ ID NO: 209, SEQ ID NO: 990, SEQ ID NO: 185, SEQ ID NO: 883, SEQ ID NO: 8, 
SEQ ID NO: 887, SEQ ID NO: 350, SEQ ID NO: 987, SEQ ID NO: 63, SEQ ID NO: 249, 
SEQ ID NO: 118, SEQ ID NO: 132, SEQ ID NO: 47, SEQ ID NO: 106, SEQ ID NO: 324, 
SEQ ID NO: 155, SEQ ID NO: 121, SEQ ID NO: 153, SEQ ID NO: 87, SEQ ID NO: 986, 
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SEQ ID NO: 262, SEQ ID NO: 333, SEQ ID NO: 36, SEQ ID NO: 982, SEQ ID NO: 180, 
SEQ ID NO: 84, SEQ ID NO: 900, SEQ ID NO: 20, SEQ ID NO: 7, SEQ ID NO: 61, SEQ 
ID NO: 253, SEQ ID NO: 120, SEQ ID NO: 268, SEQ ID NO: 299, SEQ ID NO: 942, 
SEQ ID NO: 173, SEQ ID NO: 187,SEQIDNO: 187, SEQ ID NO: 234, SEQ ID NO: 

5 1 12, SEQ ID NO: 324, SEQ ID NO: 971, SEQ ID NO: 62, SEQ ID NO: 308, SEQ ID NO: 
74, SEQ ID NO: 1, SEQ ID NO: 266, SEQ ID NO: 337, SEQ ID NO: 93, SEQ ID NO: 44, 
SEQ ID NO: 335, SEQ ID NO: 368, SEQ ID NO: 208, SEQ ID NO: 358, SEQ ID NO: 
923, SEQ ID NO: 310, SEQ ID NO: 26, SEQ ID NO: 279, SEQ ID NO: 890, SEQ ID NO: 
325, SEQ ID NO: 109, SEQ ID NO: 143, SEQ ID NO: 918, SEQ ID NO: 252, SEQ ID 
10 NO: 953, SEQ ID NO: 902, SEQ ID NO: 174, SEQ ID NO: 73, SEQ ID NO: 898, SEQ ID 
NO: 300, SEQ ID NO: 356, SEQ ID NO: 298, SEQ ID NO: 354, SEQ ID NO: 138, SEQ 
ID NO: 319, SEQ ID NO: 80, SEQ ID NO: 933, SEQ ID NO: 891, SEQ ID NO: 366, SEQ 
ID NO: 1 13, SEQ ID NO: 320, SEQ ID NO: 915, SEQ ID NO: 351, SEQ ID NO: 162, 
SEQ ID NO: 965, SEQ ID NO: 67, SEQ ID NO: 314, SEQ ID NO: 904, SEQ ID NO: 345, 

15 SEQ ID NO: 374, SEQ ID NO: 962, SEQ ID NO: 270, SEQ ID NO: 1 86, SEQ ID NO:60, 
SEQ ID NO: 379, SEQ ID NO: 889, SEQ ID NO: 967, SEQ ID NO: 973, SEQ ID NO: 
280, SEQ ID NO: 170, SEQ ID NO: 985, and SEQ ID NO: 932. 

1 0. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
20 polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in energy conversion encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 1 147, SEQ ID NO: 1288, SEQ ID NO: 1324, SEQ ID NO: 1363, 
SEQ ID NO: 882, SEQ ID NO: 382, SEQ ID NO: 130, and SEQ ID NO: 230. 

25 11. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in amino acid metabolism encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 997, SEQ ID NO: 1015, SEQ ID NO: 1084, SEQ ID NO: 1094, 
SEQ ID NO: 1099, SEQ ID NO: 1229, SEQ ID NO: 1250, SEQ ID NO: 1268, SEQ ID 

30 NO: 1293, SEQ ID NO: 1339, SEQ ID NO: 1408, SEQ ID NO: 1429, SEQ ID NO: 1434, 
SEQ ID NO: 1228, SEQ ID NO: 1031, SEQ ID NO: 1034,SEQIDNO: 1008,SEQBD 
NO: 269, SEQ ID NO: 312, SEQ ID NO: 211, SEQ ID NO: 959^ SEQ ID NO: 938, SEQ 
ED NO: 1 10, SEQ ID NO: 244, SEQ ED NO: 328, SEQ ID NO: 235, SEQ ED NO: 315, 
SEQ ID NO: 296, SEQ ID NO: 976, SEQ ID NO: 321, SEQ ID NO: 43, SEQ ID NO: 281 , 

35 SEQ ID NO: 326, and SEQ ID NO: 272. 

1 2. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in nucleotide metabolism encoded by the nucleic acid selected from the group 
. 40 consisting ofSEQ ID NO: 1061, SEQ IDNO: 1064, SEQ ID NO: 1191, SEQ ID NO: 1217, 
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SEQ ID NO: 1365, SEQ ID NO: 1394, SEQ ID NO: 1414, SEQ ID NO: 1415, SEQ ID 
NO: 1435, SEQ ID NO: 1058, SEQ ID NO: 1059, SEQ ID NO: 344, SEQ ID NO: 139, 
SEQ ID NO: 30, SEQ ID NO: 220, SEQ ID NO: 364, SEQ ID NO: 369, SEQ ID NO: 372, 
SEQ ID NO: 991, SEQ ID NO: 128, SEQ ID NO: 347, and SEQ ID NO: 52. 

5 

13. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in carbohydrate metabolism encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 1080, SEQ ID NO: 1 128, SEQ ID NO: 1 133, SEQ ID NO: 121 1, 

10 SEQ ID NO: 1252, SEQ ID NO: 1253, SEQ ID NO: 1286, SEQ ID NO: 1289, SEQ ID 
NO: 1291, SEQ ID NO: 1303, SEQ ID NO: 1396, SEQ ID NO: 996, SEQ ID NO: 12, SEQ 
ID NO: 247, SEQ ID NO: 64, SEQ ID NO: 1 01, SEQ ID NO: 338, SEQ ID NO: 83, SEQ 
ID NO: 46, SEQ ID NO: 348, SEQ ID NO: 223, SEQ ID NO: 39, SEQ ID NO: 232, and 
SEQ ID NO: 168. 

15 

14. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in cofactor metabolism encoded by the nucleic acid selected from the group 
consisting of SEQ ID NO: 1095, SEQ ID NO: 1 156, SEQ ID NO: 1 158, SEQ ID NO: 1 159, 

20 SEQ ID NO: 1277, SEQ ID NO: 1038, SEQ ID NO: 65, SEQ ID NO: 952, SEQ ID NO: 
341, SEQ ID NO: 69, SEQ ID NO: 924, and SEQ ID NO: 4. 

15. The purified nucleic acid of claim 9, wherein said H/y/or/ cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 

25 involved in lipid metabolism encoded by the nucleic acid selected from the group 

consisting of SEQ ID NO: 1257, SEQ ID NO: 1357, SEQ ID NO: 1436, SEQ ID NO: 197, 
SEQ ID NO: 313, and SEQ ID NO: 1 19. 

16.. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
30 polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in mRNA translation and ribosome biogenesis encoded by the nucleic acid 
selected from the group consisting of SEQ ID NO: 1047, SEQ ID NO: 1055, SEQ ID NO: 
SEQ ID NO: 1141, SEQ ID NO: 1227, SEQ ID NO: 1327, SEQ ID NO: 1412, SEQ ID 
NO: 1 88, SEQ ID NO: 956, SEQ ID NO: 935, SEQ ID NO: 246, SEQ ID NO: 196, and 
35 SEQ ID NO: 376. 

1 7. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in genome replication, transcription, recombination and repair encoded by the 
40 nucleic acid selected from the group consisting of SEQ ID NO: 1 003, SEQ ID NO: 1 087, 
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SEQIDNO: 1116, SEQ ID NO: 1130, SEQIDNO: 1132, SEQIDNO: 1185, SEQ ID 
NO: 11 88, SEQ ID NO: 1 198, SEQ ID NO: 121 8, SEQ ID NO: 1244, SEQ ID NO: 1306, 
SEQ ID NO: 1325, SEQ ID NO: 1397, SEQ ID NO: 1398, SEQ ID NO: 1407, SEQ ID 
NO: 1433, SEQ ID NO: 172, SEQ ID NO: 25, SEQ ID NO: 126, SEQ ID NO: 951, SEQ 
5 ID NO: 147, SEQ ID NO: 895, SEQ ID NO: 14, SEQ ID NO: 154, SEQ ID NO: 277, SEQ 
ID NO: 363, SEQ ID NO: 342, SEQ ID NO: 378, SEQ ID NO: 130, SEQ ID NO: 198, . 
SEQ ID NO: 243, SEQ ID NO: 19, and SEQ ID NO: 9. 

18. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
10 polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 

involved in outer membrane or cell wall biosynthesis encoded by the nucleic acid selected 
from the group consisting of SEQ ID NO: 1216, SEQ ID NO: 1239, SEQ IDNO: 1362, 
SEQ ID NO: 1017, SEQ ID NO: 1019, SEQ ID NO: 1360, SEQ ID NO: 149, SEQ ID NO: 
167, SEQ ID NO: 349, SEQ ID NO: 209, SEQ ID NO: 990, SEQ ID NO: 185, SEQ ID 
15 NO: 883, and SEQ ID NO: 8. 

19. The purified nucleic acid of claim 9, wherein said H. pylori cytoplasmic 
polypeptide is an H. pylori chaperone polypeptide or a fragment thereof encoded by the 
nucleic add selected from the group consisting of 

20 SEQ ID NO: 1423, SEQ ID NO: 1425, SEQ ID NO: 1374, SEQ ID NO: 887, SEQ ID NO: 
350, and SEQ IDNO: 987. 

20. A recombinant expression vector comprising the nucleic acid of claim 9 
operably linked to a transcription regulatory element 

25 

21. A cell comprising a recombinant expression vector of claim 20. 

22. A method for producing an H. pylori polypeptide comprising culturing a cell 
of claim 21 under conditions that permit expression of the polypeptide. 

23. An isolated nucleic acid comprising a nucleotide sequence encoding an H. 
pylori secreted or periplasmic polypeptide or a fragment thereof, said nucleic acid selected 
from the group consisting of SEQ ID NO: 1004, SEQ ID NO: 1 138, SEQ ID NO: 1067, 
SEQIDNO: 1078, SEQ ID NO: 1314, SEQ IDNO: 1319.SEQIDNO: 1378, SEQ ID 

35 NO: 1 105, SEQ ID NO: 1 1 14, SEQ ID NO: 1 1 18, SEQ ID NO: 1 120, SEQ ID NO: 1 123, 
SEQ ID NO: 1127, SEQ ID NO: 1212, SEQ ID NO: 1223, SEQ ID NO: 1225, SEQ ID 
NO: 1246, SEQ ID NO: 1248, SEQ ID NO: 1259, SEQ ID NO: 1264, SEQ ID NO: 1265, 
SEQ ID NO: 1281, SEQ ID NO: 1285, SEQ ID NO: 1294, SEQ ID NO: 1298, SEQ ID 
NO: 1299, SEQ ID NO: 1315, SEQ IDNO: 1316, SEQ IDNO: 1317, SEQIDNO: 1318, 

40 SEQ ID NO: 1344, SEQ ID NO: 1351, SEQ IDNO: 1353, SEQ IDNO: 1373, SEQ ID 
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NO: 1380, SEQ ID NO: 1387, SEQ ID NO: 1389, SEQ ID NO: 1393, SEQ ID NO: 141 1, 
SEQ ID NO: 1428, SEQ ID NO: 143 1, SEQ IDNO: 1439, SEQ ID NO: 1043, SEQ ID 
NO: 1 183, SEQ ID NO: 1 184, SEQ ID NO: 1 196, SEQ ID NO: 1 197, SEQ ID NO: 1203, 
SEQ ID NO: 995, SEQ ID NO: 998, SEQ EDNO: 1001, SEQ ID NO: 1022, SEQ ID NO: 

5 1023, SEQ ID NO: 1029, SEQ ID NO: 1040, SEQ ID NO: 1051, SEQ ID NO: 1062, SEQ 
IDNO: 1154,SEQIDNO: 1320,SEQEDNO: 1075, SEQ ID NO: 1106, SEQ IDNO: 
1 109, SEQ ID NO: 1 134, SEQ IDNO: 1221, SEQ ID NO: 1226, SEQ ID NO: 1235, SEQ 
ID NO: 1301, SEQ ID NO: 131 1 , SEQ ID NO: 1326, SEQ ID NO: 1341, SEQ ID NO: 
1354, SEQ ID NO: 1364, SEQ ID NO: 1366, SEQ ID NO: 1376, SEQ ID NO: 1391, SEQ 

10 ID NO: 1395, SEQ ID NO: 1445, SEQ ID NO: 1079, SEQ ID NO: 1 1 86, SEQ ID NO: 
1010,SEQIDNO: 1016.SEQEDNO: 1172,SEQIDNO: 1174.SEQIDNO: 117.SEQ 
ID NO: 254, SEQ ID NO: 24, SEQ ID NO: 242, SEQ ID NO: 950, SEQ ID NO: 263, SEQ 
ID NO: 286, SEQ ID NO: 947, SEQ ID NO: 51, SEQ IDNO: 177, SEQ ID NO: 156, SEQ 
ID NO: 190, SEQ ID NO: 375, SEQ ID NO: 222, SEQ ID NO: 21, SEQ ID NO: 912, SEQ 

15 ID NO: 148, SEQ ID NO: 202, SEQ ID NO: 224, SEQ ID NO: 1 12, SEQ ID NO: 32, SEQ 
ID NO: 339, SEQ ID NO: 182, SEQ ID NO: 228, SEQ ID NO: 152, SEQ ID NO: 219, 
SEQ ID NO: 137, SEQ ID NO: 318, SEQ ID NO: 141, SEQ ID NO: 165, SEQ ID NO: 
334, SEQ ID NO: 13, SEQ ID NO: 297, SEQ ED NO: 35, SEQ ID NO: 216, SEQ ID NO: 
908, SEQ ID NO: 124, SEQ ID NO: 75, SEQ ID NO: 927, SEQ ID NO: 221, SEQ ID NO: 

20 178, SEQ ED NO: 169, SEQ ED NO: 293, SEQ ED NO: 289, SEQ ED NO: 926, SEQ ED 
NO: 948, SEQ ID NO: 1 15, SEQ ID NO: 251, SEQ ED NO: 345, SEQ ID NO: 17, SEQ ED 
NO: 920, SEQ ED NO: 95, SEQ ED NO: 86, SEQ ED NO: 360, SEQ ID NO: 271, SEQ ED 
NO: 970, SEQ ED NO: 288, SEQ ID NO: 282, SEQ ED NO: 98, SEQ ED NO: 29, SEQ ID 
NO: 317, SEQ ED NO: 343, SEQ ID NO: 291, SEQ ID NO: 108, SEQ ED NO: 377, SEQ 

25 ID NO: 305, SEQ ED NO: 305, SEQ ID NO: 100, SEQ ED NO: 988, SEQ ED NO: 212, 

SEQ ID NO: 884, SEQ ED NO: 37, SEQ ID NO: 968, SEQ ED NO: 975, SEQ ID NO: 237, 
SEQ IDNO: 335, SEQ EDNO: 260, SEQ EDNO: 370, SEQ IDNO: 91, SEQ IDNO: 276, 
SEQ ID NO: 31 1, SEQ ED NO: 173, SEQ ID NO: 102, SEQ ID NO: 304, SEQ ID NO: 
380, SEQ ED NO: 127, SEQ ED NO: 993, SEQ ED NO: 925, SEQ ED NO: 1 81, and SEQ ED 
30 NO: 171. 

24. A recombinant expression vector comprising the nucleic acid of claim 23 
operably linked to a transcription regulatory element. 

35 25. A cell comprising a recombinant expression vector of claim 24. 



26. A method for producing an H. pylori polypeptide comprising culturing a cell 
of claim 25 under conditions that permit expression of the polypeptide. 
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27. An isolated nucleic acid comprising a nucleotide sequence encoding an H. 
pylori surface or membrane polypeptide or a fragment thereof, said nucleic acid selected 
from the group consisting of SEQ ID NO: 1060, SEQ ID NO: 1110, SEQIDNO: 1112, 
SEQ ID NO: 1230, SEQ ID NO: 1260, SEQ ID NO: 1280, SEQ ID NO: 1292, SEQ ID 

5 NO: 1296, SEQ ID NO: 1307, SEQ ID NO: 1442, SEQ ID NO: 1444, SEQ ID NO: 1 122, 
SEQ ID NO: 1254, SEQ ID NO: 1256, SEQ ID NO: 1272, SEQ ID NO: 1275, SEQ ID 
NO: 1309, SEQ ID NO: 1313, SEQ ID NO: 1347, SEQ ID NO: 1352, SEQ ID NO: 1356, 
SEQ ID NO: 1438, SEQ ID NO: 1441, SEQ ID NO: 1009, SEQ ID NO: 1026, SEQ ID 
NO: 1048, SEQ ID NO: 1063, SEQ ID NO: 1 190, SEQ ID NO: 1083, SEQ ID NO: 1 1 13, 

10 SEQ ID NO: 1222, SEQ ID NO: 1295, SEQ ID NO: 1343, SEQ ID NO: 1392, SEQ ID 
NO: 1443, SEQIDNO: 1085, SEQ ID NO: 1093, SEQIDNO: 1117,SEQIDNO: 1121, 
SEQ ID NO: 1 131, SEQ ID NO: 1287, SEQ ID NO: 1440, SEQ ID NO: 1209, SEQ ID 
NO: 1342, SEQ ID NO: 1381, SEQ ID NO: 1390, SEQ ID NO: 1409, SEQ ID NO: 1035, 
SEQ ID NO: 1014, SEQ ID NO: 1088, SEQ ID NO: 1242, SEQ ID NO: 1 178, SEQ ID 

15 NO: 1089, SEQ ID NO: 1340, SEQ ID NO: 1074, SEQ ID NO: 1 107, SEQ ID NO: 1204, 
SEQ ID NO: 1066, SEQ ID NO: 381, SEQ ID NO: 229, SEQ ID NO: 323, SEQ ID NO: 
371, SEQ ID NO: 284, SEQ ID NO: 1 16, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 
907, SEQ ID NO: 193, SEQ ID NO: 145, SEQ ID NO: 59, SEQ ID NO: 322, SEQ ID NO: 
94, SEQ ID NO: 306, SEQ ID NO: 939, SEQ ID NO: 205, SEQ ID NO: 123, SEQ ID NO: 

20 906, SEQ ID NO: 928, SEQ ID NO: 346, SEQ ID NO: 129, SEQ ID NO: 307, SEQ ID 
NO: 133, SEQ ID NO: 131, SEQ ID NO: 886, SEQ ID NO: 179, SEQ ID NO: 104, SEQ 
ID NO: 213, SEQ ID NO: 359, SEQ ID NO: 140, SEQ ID NO: 146, SEQ ID NO: 327, 
SEQ ID NO: 365, SEQ ID NO: 33, SEQ ID NO: 331, SEQ ID NO: 175, SEQ ID NO: 200, 
SEQ ID NO: 292, SEQ ID NO: 23, SEQ ID NO: 336, SEQ ID NO: 301, SEQ ID NO: 28, 

25 SEQ ID NO: 941, SEQ ID NO: 103, SEQ ID NO: 231, SEQ ID NO: 176, SEQ ID NO: 31, 
SEQ ID NO: 917, SEQ ID NO: 151, SEQ ID NO: 922, SEQ ID NO: 265, SEQ ID NO: 
142, SEQ ID NO: 259, SEQ ID NO: 122, SEQ ID NO: 206, SEQ ID NO: 96, SEQ ID NO: 
353, SEQ ID NO: 38, SEQ ID NO: 89, SEQ ID NO: 77, SEQ ID NO: 954, SEQ ID NO: 
264, SEQ ID NO: 937, SEQ ID NO: 226, SEQ ID NO: 283, SEQ ID NO: 88, SEQ ID NO: 

30 125, SEQ ID NO: 183, SEQ ID NO: 195, SEQ ID NO: 81, SEQ ID NO: 901, SEQ ID NO: 
82, SEQ ID NO: 42, SEQ ID NO: 881, and SEQ ID NO: 885. . 

28. The purified nucleic acid of claim 27, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
35 thereof having at least one membrane spanning region encoded by the nucleic acid selected 
from the group consisting of SEQ ID NO: 1060, SEQ ID NO: 1110, SEQ ID NO: 1112, 
SEQ ID NO: 1230, SEQ ID NO: 1260, SEQ ID NO: 1280, SEQ ID NO: 1292, SEQ ID 
NO: 1296, SEQ ID NO: 1307, SEQ ID NO: 1442, SEQ ID NO: 1444, SEQ ID NO: 381, 
SEQ ID NO: 229, SEQ ID NO: 323, SEQ ID NO: 371, SEQ ID NO: 284, SEQ ID NO: 
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116, SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 907, SEQ ID NO: 193, SEQ ID NO: 
145, SEQ ID NO: 59, SEQ ID NO: 322, SEQ ID NO: 94, SEQ ID NO: 306, and SEQ ID 
NO: 881. 

5 29. The purified nucleic acid of claim 27, wherein said H. pylori surface or 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
thereof having at least two membrane spanning regions encoded by the nucleic acid 
selected from the group consisting of SEQ ID NO: 1122, SEQ ID NO: 1254, SEQ ID NO: 
1256, SEQ ID NO: 1272, SEQ ID NO: 1275, SEQ ID NO: 1309, SEQ ID NO: 1313, SEQ 

10 ID NO: 1347, SEQ ID NO: 1352, SEQ ID NO: 1356, SEQ ID NO: 1438, SEQ ID NO: 
1441, SEQ ID NO: 1009, SEQ ID NO: 1026, SEQ ID NO: 1048, SEQ ID NO: 1063, SEQ 
ID NO: 1190, SEQ ID NO: 939, SEQ ID NO: 205, SEQ ID NO: 123, SEQ ID NO: 906, 
SEQ ID NO: 928, SEQ ID NO: 346, SEQ ID NO: 129, SEQ ID NO: 307, SEQ ID NO: 
133, SEQ ID NO: 131, SEQ ID NO: 886, SEQ ID NO: 179, SEQ ID NO: 104, SEQ ID 

15 NO: 213, SEQ ID NO: 359, SEQ ID NO: 140, SEQ ID NO: 146, SEQ ID NO: 327, and 
SEQ ID NO: 365. 

30. The purified nucleic acid of claim 27, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 

20 . thereof having at least three membrane spanning regions encoded by the nucleic acid 

selected from me group consisting of SEQ ID NO: 1 083, SEQ ID NO: 1 1 13, SEQ ID NO: 
1222, SEQ ID NO: 1295, SEQ ID NO: 1343, SEQ ID NO: 1392, SEQ ID NO: 1443, SEQ 
ID NO: 33, SEQ ID NO: 331, SEQTD NO: 175, SEQ ID NO: 200, SEQ ID NO: 292, SEQ 
ED NO: 23, and SEQ ID NO: 336. 

25 

31. The purified nucleic acid of claim 27, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
thereof having at least four membrane spanning regions encoded by the nucleic acid 
selected from the group consisting of SEQ ID NO: 1085, SEQ ID NO: 1093, SEQ ID NO: 

30 1117,SEQIDNO: 1121,SEQEDNO: 1131, SEQ ID NO: 1287,SEQIDNO: 1440, SEQ 
ID NO: 1209, SEQ ID NO: 301, SEQ ID NO: 28, SEQ ID NO: 941, SEQ ID NO: 103, 
SEQ ID NO: 231, SEQ ID NO: 176, SEQ ID NO: 31, SEQ ID NO: 917,SEQIDNO: 151, 
and SEQ ID NO: 922. 

35 32. The purified nucleic acid of claim 27, wherein said H. pylori surface or 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
thereof having at least five membrane spanning regions encoded by the nucleic acid 
selected from the group consisting of SEQ ID NO: 1342, SEQ ID NO: 1381, SEQ ID NO: 
1390, SEQ ID NO: 1409, SEQ ID NO: 1035, SEQ ID NO: 265, SEQ ID NO: 142, SEQ ID 

40 NO: 259, SEQ ID NO: 122, SEQ ID NO: 206, and SEQ ID NO: 885. 
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33. The purified nucleic acid of claim 27, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
thereof having at least six membrane spanning regions encoded by the nucleic acid selected 

5 from the group consisting of SEQ ID NO: 1014, SEQ ID NO: 1088, SEQ ID NO: 1242, 
SEQ ID NO: 1 178, SEQ ID NO: 96, SEQ ID NO: 353, SEQ ID NO: 38, SEQ ID NO: 89, 
SEQ ID NO: 77, SEQ ID NO: 954, and SEQ ID NO: 264. 

34. The purified nucleic acid of claim 27, wherein said if. /y/o/i surface or 
10 membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 

thereof having at least seven membrane spanning regions encoded by the nucleic acid 
selected from the group consisting of SEQ ID NO: 1089, SEQ ID NO: 1340, SEQ ID NO: 
1074, SEQ ID NO: 1 107, SEQ ID NO: 1204, SEQ ID NO: 1066, SEQ ID NO: 937, SEQ 
ID NO: 226, SEQ ID NO: 283, SEQID NO: 88, SEQ ED NO: 125, SEQ ID NO: 183, SEQ 
15 ID NO: 195, SEQ ID NO: 81, SEQ ID NO: 901, SEQ ID NO; 82, and SEQ ID NO: 42. 

35. A recombinant expression vector comprising the nucleic acid of claim 27 
operably linked to a transcription regulatory element. 

20 36. A cell comprising a recombinant expression vector of claim 35. 

37. A method for producing an K pylori polypeptide comprising culturing a cell 
of claim 36 under conditions that permit expression of the polypeptide. 

25 38. A vaccine composition for prevention or treatment of gn H. pylori infection 

comprising an effective amount of a nucleic acid of claim 1 . 

39. A vaccine composition of claim 38, further comprising a pharmaceutically 
acceptablecarrier. 

30 

40. A vaccine composition of claim 39, wherein the pharmaceutically acceptable 
carrier is an adjuvant 

41 . A method of treating a subject for H. pylori infection comprising 

35 administering to a subject a vaccine composition of claim 38, such that treatment of//. 
pylori 

infection occurs. 



40 



42. 



A method of claim 41 , wherein the treatment is a prophylactic treatment. 
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43. A method of claim 41, wherein the treatment is a therapeutic treatment 

44. A vaccine composition for prevention or treatment of an H. pylori infection 
comprising an effective amount of a nucleic acid of claim 9. 

5 

45. A vaccine composition of claim 44, further comprising a phannaceutically 
acceptable carrier. 

46. A vaccine composition of claim 45, wherein the phannaceutically acceptable 
10 carrier is an adjuvant 

47. A method of treating a subject for K pylori infection comprising 
administering to a subject a vaccine composition of claim 44, such that treatment ofH. 
pylori 

15 infection occurs. 

48. A method of claim 47, wherein the treatment is a prophylactic treatment 

49. A method of claim 47, wherein the treatment is a therapeutic treatment 

20 

50. A vaccine composition for prevention or treatment of an H. pylori infection 
comprising an effective amount of a nucleic acid of claim 23. 

51. A vaccine composition of claim 50, further comprising a phannaceutically 
25 acceptable carrier. 

52. A vaccine composition of claim 5 1 , wherein the phannaceutically acceptable 
carrier is an adjuvant 

30 53. A method of treating a subject for K pylori infection comprising 

administering to a subject a vaccine composition of claim 50, such that treatment of H. 
pylori 

infection occurs. 

35 54. A method of claim 53, wherein the treatment is a prophylactic treatment 

55. A method of claim 53, wherein the treatment is a therapeutic treatment 
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56. A purified H. pylori cell envelope polypeptide or a fragment thereof; 
wherein said polypeptide is selected from the group consisting of SEQ ID NO: 1471, SEQ 
ID NO: 1472, SEQ ID NO: 1487, SEQ ID NO: 1501, SEQ ID NO: 1522, SEQ ID NO: 
1552, SEQ ID NO: 1586, SEQ ED NO: 1727, SEQ ID NO: 1601, SEQ ID NO: 1638, SEQ 
5 ID NO: 1643, SEQ ID NO: 1812, SEQ ID NO: 1830<SEQEDNO: 1850, SEQ ID NO: 
1854, SEQ ID NO: 1851, SEQ ED NO: 1640, SEQ ED NO: 1453, SEQ ED NO: 1664, SEQ 
ED NO: 1665, SEQ ED NO: 1666, SEQ ED NO: 1685, SEQ ED NO: 1687, SEQ ED NO: 
1688, SEQ ED NO: 1675, SEQ ED NO: 1702, SEQ ED NO: 1713, SEQ ED NO: 1600, SEQ 
ED NO: 1671, SEQ ED NO: 1691, SEQ ED NO: 1615, SEQ ED NO: 1616, SEQ ED NO: 
10 1855, SEQ ED NO: 1595, SEQ ED NO: 1633, SEQ ED NO: 1608, SEQ ED NO: 161 1, SEQ 
ED NO: 1751, SEQ ED NO: 1772, SEQ ED NO: 1774, SEQ ED NO: 1780, SEQ ID NO: 
1783, SEQ ED NO: 1796, SEQ ED NO: 1809, SEQ ED NO: 1826, SEQ ED NO: 1868, SEQ 
ED NO: 1734, SEQ ED NO: 1786, SEQ ED NO: 1819, SEQ ED NO: 1630, SEQ ED NO: 
1706, SEQ ED NO: 1709, SEQ ID NO: 1495, SEQ ED NO: 1724, SEQ ED NO: 1670, SEQ 
15 ED NO: 1725, SEQ ED NO: 1661, SEQ ED NO: 1873, SEQ ED NO: 1753, SEQ ED NO: 
1759, SEQ ED NO: 1761, SEQ ED NO: 1782, SEQ ED NO: 1 883, SEQ ID NO: 1503, SEQ 
ED NO: 1542, SEQ ED NO: 1 872, SEQ ID NO: 1 520, SEQ ED NO: 1456, SEQ ED NO: 
1458, SEQ ED NO: 1617, SEQ ED NO: 1628, SEQ ED NO: 1644, SEQ ED NO: 1657, SEQ 
ED NO: 1658, SEQ ED NO: 1755, SEQ ID NO: 1756, SEQ ED NO: 1797, SEQ ID NO: 
20 1799, SEQ ED NO: 1801, SEQ ED NO: 1483, SEQ ED NO: 1504, SEQ ID NO: 1532, SEQ 
ID NO: 1575, SEQ ED NO: 1833, SEQ ED NO: 1888, SEQ ID NO: 1714, SEQ ED NO: 
1624, SEQ ED NO: 1856, SEQ ED NO: 1857, SEQ ED NO: 1861, SEQ ED NO: 1537, SEQ 
EO NO: 1773, SEQ ED NO: 1717, SEQ ID NO: 1733, SEQ ED NO: 1722, SEQ ED NO: 
1659, SEQ ID NO: 1577, SEQ ED NO: 1721, SEQ ED NO: 1729, SEQ ID NO: 1870, SEQ 
25 ED NO: 1576, SEQ ED NO: 1632, SEQ ID NO: 1867, SEQ ID NO: 1547, SEQ ID NO: 
1533, SEQ ED NO: 1597, SEQ ED NO: 1596, SEQ ID NO: 1559, SEQ ED NO: 1599, SEQ 
ID NO: 1788, SEQ ED NO: 1789, SEQ ED NO: 1875, SEQ ID NO: 1451, SEQ ED NO: 
1478, SEQ ED NO: 1626, SEQ ED NO: 1781, SEQ ED NO: 660, SEQ ED NO: 660, SEQ ED 
NO: 855, SEQ ID NO: 534, SEQ ED NO: 675, SEQ ED NO: 404, SEQ ID NO: 518, SEQ 
30 ED NO: 464, SEQ ED NO: 672, SEQ ID NO: 640, SEQ ID NO: 490, SEQ ED NO: 755, 
SEQ ED NO: 389, SEQ ID NO: 635, SEQ ED NO: 877, SEQ ED NO: 637, SEQ ID NO: 
477, SEQ ID NO: 772, SEQ ID NO: 658, SEQ ED NO: 463, SEQ ID NO: 852, SEQ ED 
NO: 503, SEQ ED NO: 41 1, SEQ ED NO: 441, SEQ ID NO: 782, SEQ ID NO: 575, SEQ 
ID NO: 691, SEQ ED NO: 724, SEQ ID NO: 452, SEQ ID NO: 386, SEQ ID NO: 497, 
35 SEQ ID NO: 712, SEQ ID NO: 591, SEQ ID NO: 638, SEQ ID NO: 740, SEQ ID NO: 
697, SEQ ID NO: 569, SEQ ID NO: 470, SEQ ID NO: 700, SEQ ID NO: 586, SEQ ID 
NO: 823, SEQ ID NO: 627, SEQ ID NO: 627, SEQ ID NO: 684, SEQ ID NO: 55 1 , SEQ 
ID NO: 478, SEQ ID NO: 508, SEQ ID NO: 545, SEQ ID NO: 628, SEQ ID NO: 443, 
SEQ ID NO: 702, SEQ ID NO: 776, SEQ ID NO: 461 . SEQ ID NO: 737, SEQ ID NO: 
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809, SEQ ID NO: 642, SEQ ID NO: 879, SEQ ID NO: 773, SEQ ID NO: 468, SEQ ID 
No! 842, SEQ ID NO: 788, SEQ ID NO: 624, SEQ ID NO: 788, SEQ ID NO: 644, SEQ 
ID NO: 727, SEQ ID NO: 631, SEQ ID NO: 450, SEQ ID NO: 448, SEQ ID NO: 653, 
SEQ ID NO: 495, SEQ ID NO: 400, SEQ ID NO: 541, SEQ ID NO: 673, SEQ ID NO: 

5 482, SEQ ID NO: 622, SEQ ID NO: 689, SEQ ID NO: 736, SEQ ID NO: 417, SEQ ID 
NO: 716, SEQ ID NO: 762, SEQ ID NO: 395, SEQ ID NO: 587, SEQ ID NO: 669, SEQ 
ID NO: 758, SEQ ID NO: 593, SEQ ID NO: 451, SEQ ID NO: 827, SEQ ID NO: 502, 
SEQ ID NO: 719, SEQ ID NO: 469, SEQ ID NO: 71 5, SEQ ID NO: 847, SEQ ID NO: 
453, SEQ ID NO: 527, SEQ ID NO: 652, SEQ ID NO: 745, SEQ ID NO: 567, SEQ ID 

10 No! 848, SEQ ID NO: 430, SEQ ID NO: 748, SEQ ID NO: 396, SEQ ID NO: 588, SEQ 
ID NO: 795, SEQ ID NO: 523, SEQ ID NO: 791, SEQ ID NO: 714, SEQ ID NO: 481, 
SEQ ID NO: 765, SEQ ID NO: 837, SEQ ID NO: 833, SEQ ID NO: 585, SEQ ID NO: 
865, SEQ ID NO: 764, SEQ ID NO: 440, SEQ ID NO: 465, SEQ ID NO: 555, SEQ ID 

No! 526, SEQ ID NO: 687, SEQ ID NO: 692, SEQ ID NO: 693, SEQ ID NO: 677, SEQ 
15 ID NO: 649, SEQ ID NO: 812, SEQ ID NO: 820, SEQ ID NO: 880, SEQ ID NO: 590, 
SEQ ID NO: 713, SEQ ID NO: 750, SEQ ID NO: 613, SEQ ID NO: 437, SEQ ID NO: 

556, SEQ ID NO: 657, SEQ ID NO: 402, SEQ ID NO: 623, SEQ ID NO: 862, SEQ ID 
No! 449, SEQ E) NO: 690, SEQ ID NO: 424, SEQ ID NO: 821, SEQ ID NO: 432, SEQ 
ID NO: 81 1, SEQ ID NO: 554, and SEQ ID NO: 809. 

20 

57. The purified polypeptide of claim 56, wherein said H. pylori ceM envelope 
polypeptide or a fragment thereof is an H. pylori flagella-associated polypeptide or a 
fragment thereof selected from the group consisting of SEQ ID NO: 1471, SEQ ID NO: 
1472, SEQ ID NO: 1487, SEQ IDNO: 1501, SEQ ID NO: 1522, SEQ IDNO: 1552, SEQ 

25 ID NO: 1586, SEQ ID NO: 1727, SEQ ID NO: 1601, SEQ ID NO: 1638, SEQ ID NO: 
1643, SEQ ID NO: 1812, SEQ ID NO: 1830, SEQ ID NO: 1850, SEQ ID NO: 1854, SEQ 
ID NO: 1851, SEQ ID NO: 1640, SEQ ID NO: 660, SEQ ID NO: 660, SEQ ID NO: 855, 
SEQ ID NO: 534, SEQ ID NO: 675, SEQ ID NO: 404, SEQ ID NO: 518, SEQ ID NO: 
464, SEQ ID NO: 672, SEQ ID NO: 640, SEQ ID NO: 490, SEQ ID NO: 755, SEQ ID 

30 No! 389, SEQ ID NO: 635, SEQ ID NO: 877, SEQ ID NO: 637, SEQ ID NO: 477, SEQ 
ID NO: 772, and SEQ ID NO: 658. 

58. The purified polypeptide of claim 56, wherein said H. pylori : cell envelope 
polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide or a fragment 
35 thereof selected from the group consisting of SEQ ID NO: 1453, SEQ ID NO: 1 664, SEQ 
IDNO: 1665, SEQ IDNO: 1666, SEQ IDNO: 1685, SEQ ID NO: 1687.SEQIDNO: 
1688, SEQ ID NO: 1675, SEQ IDNO: 1702, SEQ ID NO: 1713, SEQ IDNO: 1600, SEQ 
ID NO: 1671, SEQ ID NO: 1691, SEQ ID NO: 1615 ? SEQ ID NO: 1616, SEQ ID NO: 
1855, SEQ ID NO: 1595, SEQ IDNO: 1633, SEQ ID NO: 1608, SEQ ID NO: 1611.SEQ 
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ID NO: 1751, SEQ ID NO: 1772, SEQ ID NO: 1774, SEQ ID NO: 1780, SEQ ID NO: 
1783, SEQ ID NO: 1796, SEQ ID NO: 1809, SEQ ID NO: 1826, SEQ ID NO: 1868, SEQ 
ID NO: 1734, SEQ ID NO: 1786, SEQ ID NO: i819, SEQ ID NO: 1630, SEQ ID NO: 
1706, SEQ ID NO: 1709, SEQ ID NO: 1495, SEQ ID NO: 1724, SEQ ID NO: 463, SEQ 

5 IDNO:852,SEQIDNO:503,SEQIDNO:411,SEQIDNO:441,SEQIDNO:782, 
SEQ ID NO: 575, SEQ ID NO: 691, SEQ ID NO: 724, SEQ ID NO: 452, SEQ ID NO: 
386, SEQ ID NO: 497, SEQ ID NO: 712, SEQ ID NO: 591, SEQ ID NO: 638, SEQ ID 
NO: 740, SEQ ID NO: 697, SEQ ID NO: 569, SEQ ID NO: 470, SEQ ID NO: 700, SEQ 
ID NO: 586, SEQ ID NO: 823, SEQ ID NO: 627, SEQ ID NO: 627, SEQ ID NO: 684, 

10 SEQ ID NO: 55 1, SEQ ID NO: 478, SEQ ID NO: 508, SEQ ID NO: 545, SEQ ID NO: 
628, SEQ ID NO: 443, SEQ ID NO: 702, SEQ ID NO: 776, SEQ ID NO: 461, SEQ ID 
NO: 737, SEQ ID NO: 809, SEQ ID NO: 642, SEQ ID NO: 879, SEQ ID NO: 773, SEQ 
ID NO: 468, SEQ ID NO: 842, SEQ ID NO: 788, SEQ ID NO: 624, SEQ ID NO: 788, 
SEQ ID NO: 644, SEQ ID NO: 727, SEQ ID NO: 63 1 , SEQ ID NO: 450, SEQ ID NO: 

15 448, and SEQ ID NO: 653. 

59. The purified polypeptide of claim 56, wherein said H. pylori cell envelope 
polypeptide or a fragment thereof is an H. pylori transporter polypeptide or a fragment 
thereof selected from the group consisting of SEQ ID NO: 1670, SEQ ID NO: 1725, SEQ 

20 ID NO: 1661,SEQIDNO: 1873,SEQIDNO: 1753.SEQIDNO: 1759, SEQ ID NO: 
1761, SEQ ID NO: 1782, SEQ ID NO: 1883, SEQ ID NO: 1503, SEQ ID NO: 1542, SEQ 
ID NO: 1872, SEQ ID NO: 1520, SEQ ID NO: 1456, SEQ ID NO: 1458, SEQ ID NO: 
1617, SEQ ID NO: 1628, SEQ ID NO: 1644, SEQ ID NO: 1657, SEQ ID NO: 1658, SEQ 
P NO: 1755, SEQ ID NO: 1756, SEQ ID NO: 1 797, SEQ ID NO: 1799, SEQ ID NO: 

25 1801, SEQ ID NO: 1483, SEQ ID NO: 1504, SEQ ID NO: 1532, SEQ ID NO: 1575, SEQ 
ID NO: 1833, SEQ ID NO: 1888, SEQ ID NO: 1714, SEQ ID NO: 495, SEQ ID NO: 400, 
SEQ ID NO: 541, SEQ ID NO: 673, SEQ ID NO: 482, SEQ ID NO: 622, SEQ ID NO: 
689, SEQ ID NO: 736, SEQ ID NO: 417, SEQ ID NO: 716, SEQ ID NO: 762, SEQ ID 
NO: 395, SEQ ID NO: 587, SEQ ID NO: 669, SEQ ID NO: 758, SEQ ID NO: 593, SEQ 

30 ID NO: 451, SEQ ID NO: 827, SEQ ID NO: 502, SEQ ID NO: 719, SEQ ID NO: 469, 
SEQ ID NO: 715, SEQ ID NO: 847, SEQ ID NO: 453, SEQ ID NO: 527, SEQ ID NO: 
652, SEQ ID NO: 745, SEQ ID NO: 567, SEQ ID NO: 848, SEQ ID NO: 430, SEQ ID 
NO: 748, SEQ ID NO: 396, SEQ ID NO: 588, SEQ ID NO: 795, SEQ ID NO: 523, SEQ 
ID NO: 791, SEQ ID NO: 714, SEQ ID NO: 481, and SEQ ID NO: 765. 

35 

60. The purified polypeptide of claim 56, wherein said H. pylori cell envelope 
polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide or a fragment 
thereof selected from the group consisting of SEQ ID NO: 1624, SEQ ID NO: 1856, SEQ 
ID NO: 1857, SEQ ID NO: 1861, SEQ ID NO: 1537, SEQ ID NO: 1773, SEQ ID NO: 
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1717, SEQ ID NO: 1733, SEQ ID NO: 1722, SEQ ID NO: 1659, SEQ ID NO: 1577, SEQ 
ID NO: 1721, SEQ ID NO: 1729, SEQ ID NO: 1870, SEQ ID NO: 1576, SEQ ID NO: 
1632, SEQ ID NO: 1867, SEQ ID NO: 1547, SEQ ID NO: 1533, SEQ ID NO: 1597, SEQ 
ID NO: 1596, SEQ ID NO: 1559, SEQ ID NO: 1599, SEQ ID NO: 1788, SEQ ID NO: 
5 1789, SEQ ID NO: 1875, SEQ ID NO: 1451, SEQ ID NO: 1478, SEQ ID NO: 1626, SEQ 
ID NO: 1781, SEQ ID NO: 837, SEQ ID NO: 833, SEQ ID NO: 585, SEQ ID NO: 865, 
SEQ ID NO: 764, SEQ ID NO: 440, SEQ ID NO: 465, SEQ ID NO: 555, SEQ ID NO: 
526, SEQ ID NO: 687, SEQ ID NO: 692, SEQ ID NO: 693, SEQ ID NO: 677, SEQ ID 
NO: 649, and SEQ ID NO: 812. 

10 

61. A purified H. pylori cytoplasmic polypeptide or a fragment thereof, wherein 
said polypeptide is selected from the group consisting of SEQ ID NO: 1598, SEQ ID NO: 
1739, SEQ ID NO: 1775, SEQ ID NO: 1814, SEQ ID NO: 1448, SEQ ID NO: 1466, SEQ 
ID NO: 1535, SEQ ID NO: 1545, SEQ ID NO: 1550, SEQ ID NO: 1680, SEQ ID NO: 
15 1701 , SEQ ID NO: 1719, SEQ ID NO: 1744, SEQ ID NO: 1790, SEQ ID NO: 1859, SEQ 
ID NO: 1 880, SEQ ID NO: 1 885, SEQ ID NO: 1679, SEQ ID NO: 1482, SEQ ID NO: 
1485, SEQ ID NO: 1459, SEQ ID NO: 15 12, SEQ ID NO: 1515, SEQ ID NO: 1642, SEQ 
ID NO: 1668, SEQ ID NO: 1816, SEQ ID NO: 1 845, SEQ ID NO: 1865, SEQ ID NO: 
1866, SEQ ID NO: 1886, SEQ ID NO: 1509, SEQ ID NO: 1510, SEQ ID NO: 1531, SEQ 
20 ID NO: 1579, SEQ ID NO: 1584, SEQ ID NO: 1662, SEQ ID NO: 1703, SEQ ID NO: 
1704, SEQ ID NO: 1737, SEQ ID NO: 1740, SEQ ID NO: 1742, SEQ ID NO: 1754, SEQ 
IDNO: 1847,SEQIDNO: 1447, SEQ it) NO: 1 546, SEQ H) NO: 1607,SEQIDNO: 
1609, SEQ ID NO: 1610, SEQ ID NO: 1728, SEQ ID NO: 1489, SEQ ID NO: 1708, SEQ 
IDNO: 1808, SEQ IDNO: 1887, SEQ IDNO: 1498, SEQ ID NO: 1506, SEQ IDNO: 
25 1592, SEQ ID NO: 1678, SEQ ID NO: 1778, SEQ ID NO: 1863, SEQ ID NO: 1454, SEQ 
ID NO: 1538, SEQ ID NO: 1567, SEQ ID NO: 1581, SEQ ID NO: 1583, SEQ ID NO: 
1636, SEQ ID NO: 1639, SEQ ID NO: 1649, SEQ ID NO: 1669, SEQ ID NO: 1695, SEQ 
ID NO: 1757, SEQ ID NO: 1776, SEQ ID NO: 1848, SEQ ID NO: 1849, SEQ ID NO: 
1858, SEQ ID NO: 1884, SEQ ID NO: 1667, SEQ ID NO: 1690, SEQ ID NO: 1813, SEQ 
30 ID NO: 1468, SEQ ID NO: 1470, SEQ ID NO: 181 1, SEQ ID NO: 1874, SEQ ID NO: 
1876, SEQ ID NO: 1825, SEQ ID NO: 1479, SEQ ID NO: 1488, SEQ ID NO: 1528, SEQ 
IDNO: 1566, SEQ ID NO: 1683, SEQ ID NO: 1692, SEQ ID NO: 1718, SEQ IDNO: 
1614, SEQ ID NO: 1519, SEQ ID NO: 1476, SEQ ID NO: 1493, SEQ ID NO: 1497, SEQ 
ID NO: 1507, SEQ ID NO: 1490, SEQ ID NO: 1523, SEQ ID NO: 1524, SEQ ID NO: 
35 1543, SEQ ID NO: 1551, SEQ ID NO: 1553, SEQ ID NO: 1554, SEQ ID NO: 1555, SEQ 
IDNO: 1562, SEQ ID NO: 1570, SEQ IDNO: 1587,SEQ ID NO: 1588, SEQ ID NO: 
1591, SEQ ID NO: 1593, SEQ ID NO: 1684, SEQ ID NO: 1689, SEQ ID NO: 1694, SEQ 
IDNO: 1696, SEQ ID NO: 1698, SEQ IDNO: 1700, SEQ ID NO: 1712, SEQ ID NO: 
1720, SEQ ID NO: 1730, SEQ ID NO: 1735, SEQ ID NO: 1741, SEQ IDNO: 1748, SEQ 
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E> NO: 1779, SEQ ID NO: 1821, SEQ ID NO: 1823, SEQ ID NO: 1828, SEQ ID NO: 
1834, SEQ ID NO: 1835, SEQ ID NO: 1836, SEQ ID NO: 1839, SEQ ID NO: 1852, SEQ 
IDNO: 1853, SEQ IDNO: 1869, SEQ ID NO: i871,SEQIDNO: 1878, SEQ ID NO: 
152 1, SEQ ID NO: 1602, SEQ ID NO: 1627, SEQ ID NO: 1450, SEQ ED NO: 1457, SEQ 
5 ID NO: 1463, SEQ ID NO: 1469, SEQ ID NO: 1481, SEQ ID NO: 1484, SEQ ID NO: 
1492,SEQ IDNO: 1500,SEQ IDNO: 1505,SEQ IDNO: 1508,SEQ IDNO: 1541.SEQ 
ID NO: 1548, SEQ ID NO: 1580, SEQ ID NO: 1590, SEQ ID NO: 1594, SEQ ID NO: 
1603, SEQ ED NO: 1604, SEQ ED NO: 1606, SEQ IDNO: 1612, SEQ ID NO: 1613, SEQ 
ID NO: 1620, SEQ ID NO: 1621, SEQ ID NO: 1622, SEQ ID NO: 1631, SEQ ID NO: 
10 1645, SEQ ID NO: 1646, SEQ ED NO: 1650, SEQ ID NO: 1651, SEQ ID NO: 1652, SEQ 
ID NO: 1653, SEQ ID NO: 1656, SEQ ID NO: 1763, SEQ ID NO: 1787, SEQ ID NO: 
1800, SEQ ED NO: 1806, SEQ ID NO: 1810, SEQ ID NO: 1864, SEQ ID NO: 1877, SEQ 
ID NO: 1881, SEQ ED NO: 390, SEQ IDNO: 876, SEQ IDNO: 547, SEQ IDNO: 678, 
SEQ ED NO: 729, SEQ ED NO: 786, SEQ ID NO: 654, SEQ ID NO: 734, SEQ ID NO: 
15 646, SEQ ID NO: 522, SEQ ED NO: 696, SEQ ID NO: 807, SEQ ID NO: 683, SEQ ID 
NO: 790, SEQ ID NO: 763, SEQ ID NO: 806, SEQ ID NO: 799, SEQ ID NO: 434, SEQ 
ID NO: 743, SEQ ID NO: 804, SEQ ED NO: 733, SEQ ID NO: 826, SEQ ID NO: 562, 
SEQ ID NO: 420, SEQ ID NO: 664, SEQ ID NO: 850, SEQ ED NO: 857, SEQ ED NO: 
861, SEQ ED NO: 872, SEQ ID NO: 544, SEQ ID NO: 830, SEQ ID NO: 446, SEQ ID 
20 NO: 397, SEQ ID NO: 699, SEQ ID NO: 459, SEQ ID NO: 509, SEQ ID NO: 81 8, SEQ 
ID NO: 488, SEQ ID NO: 438, SEQ ID NO: 831, SEQ ID NO: 667, SEQ ID NO: 429, 
SEQ ED NO: 680, SEQ ID NO: 597, SEQ ID NO: 460, SEQ ID NO: 709, SEQ ID NO: 
822, SEQ ID NO: 466, SEQ ID NO: 584, SEQ ID NO: 388, SEQ ID NO: 631, SEQ ID 
NO: 787, SEQ ID NO: 532, SEQ ID NO: 619, SEQ ID NO: 723, SEQ ID NO: 641, SEQ 
25 ID NO: 698, SEQ ED NO: 630, SEQ ID NO: 869, SEQ ID NO: 601, SEQ ED NO: 415, 
SEQ ID NO: 542, SEQ ID NO: 704, SEQ ID NO: 572, SEQ ID NO: 467, SEQ ED NO: 
399, SEQ ED NO: 579, SEQ ID NO: 739, SEQ ID NO: 849, SEQ ID NO: 824, SEQ ID 
NO: 871, SEQ ID NO: 547, SEQ ID NO: 633, SEQ ID NO: 695, SEQ ID NO: 405, SEQ 
ID NO: 394, SEQ ID NO: 761, SEQ ID NO: 574, SEQ ID NO: 596, SEQ ID NO: 832, 
30 SEQ ID NO: 651, SEQ ID NO: 867, SEQ ID NO: 614, SEQ ID NO: 401, SEQ ID NO: 
393, SEQ ID NO: 413, SEQ ID NO: 835, SEQ ID NO: 863, SEQ ID NO: 458, SEQ ID 
NO: 701, SEQ ID NO: 531, SEQ ID NO: 550, SEQ ID NO: 439, SEQ ID NO: 516, SEQ 
ED NO: 802, SEQ ID NO: 581, SEQ ID NO: 535, SEQ ID NO: 578, SEQ ID NO: 492, 
SEQ ID NO: 858, SEQ ID NO: 720, SEQ ID NO: 813, SEQ ID NO: 426, SEQ ID NO: 
35 834, SEQ ID NO: 609, SEQ ID NO: 489, SEQ ID NO: 480, SEQ ID NO: 406, SEQ ID 
NO: 392, SEQ ID NO: 456, SEQ ID NO: 707, SEQ ID NO: 533, SEQ ID NO: 728, SEQ 
ID NO: 769, SEQ ID NO: 671, SEQ ID NO: 602, SEQ ID NO: 618, SEQ ID NO: 618, 
SEQ ID NO: 682, SEQ ID NO: 524, SEQ ID NO: 802, SEQ ID NO: 785, SEQ ID NO: 
457, SEQ ID NO: 781, SEQ ID NO: 473, SEQ ID NO: 384, SEQ ID NO: 726, SEQ ID 
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NO: 817, SEQIDNO: 498, SEQIDNO: 436, SEQIDNO: 815, SEQIDNO: 856, SEQ 
ID NO: 650, SEQ ID NO: 844, SEQ ID NO: 580, SEQ ID NO: 783, SEQ ID NO: 416, 
SEQ ID NO: 741, SEQ ID NO: 442, SEQ ID NO: 803, SEQ ID NO: 520, SEQ ID NO: 
566, SEQ ID NO: 557, SEQ ID NO: 706, SEQ ID NO: 710, SEQ ID NO: 487, SEQ ID 

5 NO: 603, SEQ ID NO: 472, SEQ ID NO: 476, SEQ ID NO: 770, SEQ ID NO: 841, SEQ 
ID NO: 768, SEQ ID NO: 839, SEQ ID NO: 560, SEQ ID NO: 796, SEQ ID NO: 483, 
SEQ ID NO: 634, SEQ ID NO: 445, SEQ ID NO: 853, SEQ ID NO: 525, SEQ ID NO: 
798, SEQ ID NO: 549, SEQ ID NO: 836, SEQ ID NO: 589, SEQ ID NO: 760, SEQ ID 
NO: 462, SEQ ID NO: 789, ; SEQ ID NO: 507, SEQ ID NO: 828, SEQ ID NO: 866, SEQ 

10 ID NO: 754, SEQ ID NO: 730, SEQ ID NO: 617, SEQ ID NO: 455, SEQ ID NO: 873, 
SEQ ID NO: 435, SEQ ID NO: 766, SEQ ID NO: 793, SEQ ID NO: 742, SEQ ID NO: 
599, SEQ ID NO: 854, and SEQ ID NO: 632. 

62. The purified polypeptide of claim 61 , wherein said //. pylori cytoplasmic 
1 5 polypeptide or a fragment thereof is an A pylori polypeptide or a fragment thereof 

involved in energy conversion selected from the group consisting of SEQ ID NO: 1598, 
SEQ ID NO: 1739, SEQ ID NO: 1775, SEQ ID NO: 1814, SEQ ID NO: 390, SEQ ID NO: 
876, SEQ ID NO: 547, SEQ ID NO: 678. 

20 63. The purified polypeptide of claim 61, wherein said Hjpy/ori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in amino acid metabolism selected from the group consisting of SEQIDNO: 
1448, SEQ ID NO: 1466, SEQ ID NO: 1535, SEQ ID NO: 1545, SEQ ID NO: 1550, SEQ 
ID NO: 1680, SEQIDNO: 1701, SEQ ID NO: 1719, SEQ ID NO: 1744, SEQ ID NO: 

25 1790, SEQ ID NO: 1859, SEQ ID NO: 1880, SEQ ID NO: 1885, SEQ ID NO: 1679, SEQ 
ID NO: 1482, SEQ ID NO: 1485, SEQ ID NO: 1459, SEQ ID NO: 729, SEQ ID NO: 786, 
SEQ ID NO: 654, SEQ ID NO: 734, SEQ ID NO: 646, SEQ ID NO: 522, SEQ ID NO: 
696, SEQ ID NO: 807, SEQ ID NO: 683, SEQ ID NO: 790, SEQ ID NO: 763, SEQ ID 
NO: 806, SEQ ID NO: 799, SEQ ID NO: 434, SEQ ID NO: 743, SEQ ID NO: 804, SEQ 

30 ID NO: 733. 

64. The purified polypeptide of claim 61 , wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in nucleotide metabolism selected from the group consisting of SEQ ID NO: 
35 1512, SEQ ID NO: 1515, SEQ ID NO: 1642, SEQ ID NO: 1668, SEQ ID NO: 1816, SEQ 
ID NO: 1 845, SEQ ID NO: 1 865, SEQ ID NO: 1866, SEQ ID NO: 1 886, SEQ ID NO: 
1509, SEQ ID NO: 1510, SEQ ID NO: 826, SEQ ID NO: 562, SEQ ID NO: 420, SEQ ID 
NO: 664, SEQ ID NO: 850, SEQ ID NO: 857, SEQ ID NO: 861 , SEQ ID NO: 872, SEQ 
ID NO: 544, SEQ ID NO: 830, and SEQ ID NO: 446. 
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65. The purified polypeptide of claim 61, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in carbohydrate metabolism selected from the group consisting of SEQ ID NO: 
1531, SEQ ID NO: 1579, SEQ ID NO: 1584, SEQ ID NO: 1662, SEQ ID NO: 1703, SEQ 
5 ID NO: 1704, SEQ ID NO: 1737, SEQ ID NO: 1740, SEQ ID NO: 1742, SEQ ID NO: 
1754, SEQ ID NO: 1847, SEQ ID NO: 1447, SEQ ID NO: 397, SEQ ID NO: 699, SEQ ID 
NO: 459, SEQ ID NO: 509, SEQ ID NO: 818, SEQ ID NO: 488, SEQ ID NO: 438, SEQ 
ID NO: 831, SEQ ID NO: 667, SEQ ID NO: 429, SEQ ID NO: 680, and SEQ ID NO: 597. 

10 66. The purified polypeptide of claim 61, wherein said H. pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in cofactor metabolism selected from the group consisting of SEQ ID NO: 1 546, 
SEQ IDNO: 1607, SEQ ID NO: 1609, SEQ ID NO: 1610, SEQ ID NO: 1728, SEQ ID 
NO: 1489, SEQ ID NO: 460, SEQ ID NO: 709, SEQ ID NO: 822, SEQ ID NO: 466, SEQ 

15 ID NO: 584, and SEQ ID NO: 388. 

67. The purified polypeptide of claim 61 , wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof . 
involved in lipid metabolism selected from the group consisting of SEQ ID NO: 1708, SEQ 

20 IDNO: 1808, SEQ IDNO: 1887, SEQ IDNO: 631, SEQ ID NO: 787, and SEQ ID NO: 
532. 

68. The purified polypeptide of claim 61, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 

25 involved in mRNA translation and ribosome biogenesis selected from the group consisting 
of SEQ ID NO: 1498, SEQ IDNO: 1506, SEQ IDNO: 1592, SEQ IDNO: 1678, SEQ ID 
NO: 1778, SEQ ID NO: 1863, SEQ ID NO: 619, SEQ ID NO: 723, SEQ ID NO: 641, SEQ 
ID NO: 698, SEQ ID NO: 630, and SEQ ID NO: 869. 

30 69. The purified polypeptide of claim 61, wherein said H fly/ori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in genome replication, transcription, recombination and repair selected from the 
group consisting of SEQ ID NO: 1454, SEQ ID NO: 1538, SEQ ID NO: 1567, SEQ ID 
NO: 1581.SEQIDNO: 1583, SEQ ID NO: 1636, SEQ ID NO: 1639,SEQIDNO: 1649,. 

35 SEQ ID NO: 1669, SEQ ID NO: 1695, SEQ ID NO: 1757, SEQ ID NO: 1776, SEQ ID 
NO: 1848, SEQ ID NO: 1849, SEQ IDNO: 1858, SEQ ID NO: 1884, SEQ ID NO: 601, 
SEQ ID NO: 41 5, SEQ ID NO: 542, SEQ ID NO: 704, SEQ ID NO: 572, SEQ ID NO: 
467, SEQ ID NO: 399, SEQ ID NO: 579, SEQ ID NO: 739, SEQ ID NO: 849, SEQ ID 
No! 824, SEQ ID NO: 871, SEQ ID NO: 547, SEQ ID NO: 633, SEQ ID NO: 695, SEQ 

40 ID NO: 405, SEQ ID NO; 394, and SEQ ID NO: 761 . 
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70. The purified polypeptide of claim 61 , wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an K pylori polypeptide or a fragment thereof 
involved in outer membrane or cell wall biosynthesis selected from the group consisting of 

5 SEQ ID NO: 1667, SEQ ID NO: 1690, SEQ ID NO: 1813, SEQ ID NO: 1468, SEQ ID 
NO: 1470, SEQ ID NO: 1811, SEQ ID NO: 574, SEQ ID NO: 596, SEQ ID NO: 832, SEQ 
ID NO: 651, SEQ ID NO: 867, SEQ ID NO: 614, SEQ ID NO: 401, and SEQ ID NO: 393. 

71. The purified polypeptide of claim 61, wherein said H. /y/or/ cytoplasmic 
10 polypeptide or a fragment thereof is an K pylori chaperone polypeptide or a fragment 

thereof selected from the group consisting of SEQ ID NO: 1874, SEQ ID NO: 1876, SEQ 
ID NO: 1825, SEQ ID NO: 413, SEQ ID NO: 835, and SEQ ID NO: 863. 

72. • A purified H. pylori secreted or periplasmic polypeptide or a fragment 
15 thereof, wherein said polypeptide is selected from the group consisting of SEQ ID NO: 

1455, SEQ ID NO: 1589, SEQ ID NO: 1518, SEQ ID NO: 1529, SEQ ID NO: 1765, SEQ 
ID NO: 1770, SEQ ID NO: 1 829, SEQ ID NO: 1556, SEQ ID NO: 1565, SEQ ID NO: 
1569, SEQ ID NO: 1571, SEQ ID NO: 1574, SEQ ID NO: 1578, SEQ ID NO: 1663, SEQ 
ID NO: 1674, SEQ ID NO: 1676, SEQ ID NO: 1697, SEQ ID NO: 1699, SEQ ID NO: 

20 1710, SEQ ID NO: 1715, SEQ ID NO: 1716, SEQ ID NO: 1732, SEQ ID NO: 1736, SEQ 
ID NO: 1745, SEQ ID NO: 1749, SEQ ID NO: 1750, SEQ ID NO: 1766, SEQ ID NO: 
1767, SEQ ID NO: 1768, SEQ ID NO: 1769, SEQ ID NO: 1795, SEQ ID NO: 1802, SEQ 
ID NO: 1804, SEQ ID NO: 1824, SEQ ID NO: 1831, SEQ ID NO: 1838, SEQ ID NO: 
1840, SEQ ID NO: 1844, SEQ ID NO: 1862, SEQ ID NO: 1879, SEQ ID NO: 1882, SEQ 

25 ID NO: 1890, SEQ ID NO: 1494, SEQ ID NO: 1634, SEQ ID NO: 1635, SEQ ID NO: 
1647, SEQ ID NO: 1648, SEQ ID NO: 1654, SEQ ID NO: 1446, SEQ ID NO: 1449, SEQ 
ID NO: 1452, SEQ ID NO: 1473, SEQ ID NO: 1474, SEQ ID NO: 1480, SEQ ID NO: 
1491, SEQ ID NO: 1502, SEQ ID NO: 1513, SEQ ID NO: 1605, SEQ ID NO: 1771, SEQ 
ID NO: 1526, SEQ ID NO: 1557, SEQ ID NO: 1560, SEQ ID NO: 1585, SEQ ID NO: 

30 1672, SEQ ID NO: 1677, SEQ ID NO: 1686, SEQ ID NO: 1752, SEQ ID NO: 1762, SEQ 
ID NO: 1777, SEQ ID NO: 1792, SEQ ID NO: 1805, SEQ ID NO: 1815, SEQ ID NO: 
1817,SEQIDNO: 1827,SEQIDNO: 1842, SEQ ID NO: 1846, SEQ ID NO: 1896, SEQ 
ID NO: 1530, SEQ ID NO: 1637, SEQ ID NO: 1461, SEQ ID NO: 1467, SEQ ID NO: 
1623, SEQ ID NO: 1625, SEQ ID NO: 530, SEQ ID NO: 708, SEQ ID NO: 414, SEQ ID 

35 NO: 694, SEQ ID NO: 703, SEQ ID NO: 721, SEQ ID NO: 749, SEQ ID NO: 685, SEQ 
ID NO: 444, SEQ ID NO: 606, SEQ ID NO: 582, SEQ ID NO: 621, SEQ ID NO: 868, 
SEQ ID NO: 666, SEQ ID NO: 408, SEQ ID NO: 538, SEQ ID NO: 573, SEQ ID NO: 
639, SEQ ID NO: 668, SEQ ID NO: 524, SEQ ID NO: 422, SEQ ID NO: 81 9, SEQ ID 
NO: 61 1, SEQ ID NO: 674, SEQ ID NO: 577, SEQ ID NO: 663, SEQ ID NO: 558, SEQ 
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ID NO: 794, SEQ ID NO: 564, SEQ ID NO: 592, SEQ ID NO: 814, SEQ ID NO: 398, 
SEQ ID NO: 767, SEQ ID NO: 425, SEQ ID NO: 659, SEQ ID NO: 517, SEQ ID NO: 
539, SEQ ID NO: 475, SEQ ID NO: 615, SEQ ID NO: 665, SEQ ID NO: 607, SEQ ID 
NO: 598, SEQ ID NO: 759, SEQ ID NO: 752, SEQ ID NO: 595, SEQ ID NO: 686, SEQ 

5 ID NO: 528, SEQ ID NO: 705, SEQ ID NO: 828, SEQ ID NO: 403, SEQ ID NO; 561, 
SEQ ID NO: 500, SEQ ID NO: 491, SEQ ID NO: 846, SEQ ID NO: 732, SEQ ID NO: 
778, SEQ ID NO: 751, SEQ ID NO: 744, SEQ ID NO: 504, SEQ ID NO: 419, SEQ ID 
NO: 792, SEQ ID NO: 825, SEQ ID NO: 756, SEQ ID NO: 519, SEQ ID NO: 870, SEQ 
ID NO: 777, SEQ ID NO: 808, SEQ ID NO: 506, SEQ ID NO: 864, SEQ ID NO: 655, 

10 SEQ ID NO: 407, SEQ ID NO: 427, SEQ ID NO: 774, SEQ ID NO: 797, SEQ ID NO: 
688, SEQ ID NO: 815, SEQ ID NO: 718, SEQ ID NO: 859, SEQ ID NO: 775, SEQ ID 
NO: 874, SEQ ID NO: 543, SEQ ID NO: 878, SEQ ID NO: 594, SEQ ID NO: 610, and 
SEQ ID NO: 600. 

15 73. ' A inirifiedH^/or/sxirface or membrane polypeptide or augment thereof 

wherein said polypeptide is selected from the group consisting of SEQ ID NO: 1511, SEQ 
ID NO: 1561, SEQ IDNO: 1563, SEQ ID NO: 1681, SEQ ID NO: 171 1, SEQ ID NO: 
1731, SEQ ID NO: 1743, SEQ IDNO: 1747, SEQ ID NO: 1758, SEQ IDNO: 1893, SEQ 
ID NO: 1895, SEQ ID NO: 1573, SEQ ID NO: 1705, SEQ ID NO: 1707, SEQ ID NO: 
20 1723, SEQ ID NO: 1726, SEQ ID NO: 1760, SEQ ID NO: 1764, SEQ ID NO: 1798, SEQ 
IDNO: 1803, SEQ ID NO: 1807,SEQlDNO: 1889, SEQ ID NO: 1892, SEQ ID NO: 
1460, SEQ ID NO: 1477, SEQ IDNO: 1499, SEQ ID NO: 1514, SEQ IDNO: 1641, SEQ 
IDNO: 1534, SEQ IDNO: 1564, SEQ IDNO: 1673, SEQ ID NO: 1746, SEQ ID NO: 
1794, SEQ ID NO: 1843, SEQ ID NO: 1894, SEQ ID NO: 1536, SEQ ID NO: 1544, SEQ 
25 ID NO: 1568, SEQ ID NO: 1572, SEQ ID NO: 1582, SEQ ID NO: 1738, SEQ H> NO: 
1891, SEQ ID NO: 1660, SEQ IDNO: 1793, SEQ IDNO: 1832, SEQ IDNO: 1841, SEQ 
ID NO: 1860, SEQ ID NO: 1486, SEQ ID NO: 1465, SEQ ID NO: 1539, SEQ ID NO: 
1693, SEQ ID NO: 1629, SEQ ID NO: 1540, SEQ ID NO: 1791, SEQ ID NO: 1525, SEQ 
ID NO: 1558, SEQ ID NO: 1655, SEQ ID NO: 1517, SEQ ID NO: 875, SEQ ID NO: 676, 
30 SEQ IDNO: 801, SEQ ID NO: 860, SEQ IDNO: 747, SEQ IDNO: 529, SEQ IDNO: 
387, SEQ ID NO: 391, SEQ ID NO: 515, SEQ ID NO: 625, SEQ ID NO: 568, SEQ ID 
NO: 454, SEQ ID NO: 800, SEQ ID NO: 499, SEQ ID NO: 779, SEQ ID NO: 648, SEQ 
ID NO: 643, SEQ ID NO: 537, SEQ ID NO: 511, SEQ ID NO: 616, SEQ ID NO: 829, 
SEQ ID NO: 546, SEQ ID NO: 780, SEQ ID NO: 553, SEQ ID NO: 549, SEQ ID NO: 
35 410, SEQ ID NO: 608, SEQ ID NO: 5 13, SEQ ID NO: 656, SEQ ID NO: 845, SEQ ID 
NO: 563, SEQ ID NO: 570, SEQ ID NO: 805, SEQ ID NO: 851, SEQ ID NO: 423, SEQ 
ID NO: 810, SEQ ID NO: 604, SEQ ID NO: 636, SEQ ID NO: 757, SEQ ID NO: 412, 
SEQ ID NO: 816, SEQ ID NO: 771, SEQ ID NO: 418, SEQ ID NO: 662, SEQ ID NO: 
512, SEQ ID NO: 679, SEQ ID NO: 605, SEQ ID NO: 421, SEQ ID NO: 552, SEQ ID 
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NO: 576, SEQ ID NO: 571, SEQ ID NO: 725, SEQ ID NO: 565, SEQ ID NO: 717, SEQ 
ID NO: 536, SEQ ID NO: 647, SEQ ID NO: 501, SEQ ID NO: 838, SEQ ID NO: 428, 
SEQ ID NO: 494, SEQ ID NO: 479, SEQ ID NO: 71 1, SEQ ID NO: 722 ,SEQ ID NO: 
645, SEQ ID NO: 670, SEQ ID NO: 746, SEQ ID NO: 493, SEQ ID NO: 540, SEQ ID 
5 NO: 612, SEQ ID NO: 629, SEQ ID NO: 484, SEQ ID NO: 485, SEQ ID NO: 486, SEQ 
ID NO: 433, SEQ ID NO: 385, and SEQ ID NO: 409. 

74. The purified polypeptide of claim 73, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 

1 0 thereof having at least one membrane spanning region selected from the group consisting of 
SEQ ID NO: 151 1, SEQ ID NO: 1561, SEQ ID NO: 1563, SEQ ID NO: 1681, SEQ ID 
NO: 171 1, SEQ ID NO: 173 1, SEQ ID NO: 1743, SEQ ID NO: 1747, SEQ ID NO: 1758, 
SEQ ID NO: 1893, SEQ ID NO: 1895, SEQ ID NO: 875, SEQ ID NO: 676, SEQ IDNO: 
801, SEQ ID NO: 860, SEQ ID NO: 747, SEQ ID NO: 529, SEQ ID NO: 387, SEQ ID 

15 NO: 391, SEQ IDNO: 515, SEQ ID NO: 625, SEQ ID NO: 568, SEQ ID NO: 454, SEQ 
ID NO: 800, SEQ ID NO: 499, SEQ ID NO: 779, and SEQ ID NO: 385. . 

75. The purified polypeptide of claim 73, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an A pylori polypeptide or a fragment 

20 thereof having at least two membrane spanning regions selected from the group consisting 
of SEQ ID NO: 1573, SEQ ID NO: 1705, SEQ ID NO: 1707, SEQ ID NO: 1723, SEQ ID 
NO: 1726, SEQ ID NO: 1760, SEQ ID NO: 1764, SEQ ID NO: 1798, SEQ ID NO: 1803, 
SEQ ID NO: 1807, SEQ ID NO: 1889, SEQ ID NO: 1892, SEQ ID NO: 1460, SEQ ID 
NO: 1477.SEQIDNO: 1499, SEQ ID NO: 1514.SEQIDNO: 1641, SEQ ID NO: 648, 

25 SEQ ID NO: 643, SEQ ID NO: 537, SEQ ID NO: 511, SEQ ID NO: 616, SEQ It) NO: 
829, SEQ ID NO: 546, SEQ ID NO: 780, SEQ ID NO: 553, SEQ ID NO: 549, SEQ ID 
NO: 410, SEQ ID NO: 608, SEQ ID NO: 513, SEQ ID NO: 656, SEQ ID NO: 845, SEQ 
ID NO: 563, SEQ ID NO: 570, SEQ ID NO: 805, and SEQ ID NO: 851. 

30 76. The purified polypeptide of claim 73, wherein said H. pylori surface or 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
thereof having at least three membrane spanning regions selected from the group consisting 
of SEQ ID NO: 1534, SEQ ID NO: 1564, SEQ ID NO: 1673, SEQ ID NO: 1746, SEQ ID 
NO: 1794, SEQ ID NO: 1843, SEQ ID NO: 1894, SEQ ID NO: 423, SEQ ID NO: 810, 

35 SEQ ID NO: 604, SEQ ID NO: 636, SEQ ID NO: 757, SEQ ID NO: 412, and SEQ ID NO: 
816. 

77. The purified polypeptide of claim 73, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
40 thereof haying at least four membrane spanning regions selected from the group consisting 
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of SEQIDNO: 153 6, SEQ ED NO: 1544, SEQ ID NO: 1568, SEQ ID NO: 1572, SEQ ID 
NO: 1582, SEQ ID NO: 1738, SEQ ID NO: 1891, SEQ ID NO: 1660, SEQ ID NO: 771, 
SEQ ID NO: 41 8, SEQ ID NO: 662, SEQ ID NO: 512, SEQ ID NO: 679, SEQ ID NO: 
605, SEQ ID NO: 421, SEQ ID NO: 552, SEQ ID NO: 576, and SEQ ID NO: 571. 

5 

78. The purified polypeptide of claim 73, wherein said H. pylori surface or 
membrane polypeptide or a fragment thereof is an K pylori polypeptide or a fragment 
thereof having at least five membrane spanning regions selected from the group consisting 
of SEQ ID NO: 1793, SEQ ID NO: 1832, SEQ ID NO: 1 841, SEQ ID NO: 1860, SEQ ID 
10 NO: 1486, SEQ ID NO: 725, SEQ ID NO: 565, SEQ ID NO: 717, SEQ ID NO: 536, SEQ 
ID NO: 647, SEQ ID NO: 409. 

.79. The purified polypeptide of claim 73, wherein said K pylori surface or 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a. fragment 
15 thereof having at least six membrane spanning regions selected from the group consisting 
of SEQ ID NO: 1465, SEQ ID NO: 1539, SEQ ID NO: 1693, SEQ ID NO: 1629, SEQ ID 
NO: 501, SEQ ID NO: 838, SEQ ID NO: 428, SEQ ID NO: 494, SEQ ID NO: 479, SEQ 
ID NO: 711, and SEQ ID NO: 722. 

20 80. The purified polypeptide of claim 73, wherein said H. pylori surface or 

membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
thereof having at least seven membrane spanning regions selected from the group 
consisting of SEQIDNO: 1540, SEQ ID NO: 1791, SEQ ID NO: 1525, SEQIDNO: 1558, 
SEQ ID NO: 1655, SEQ ID NO: 1517, SEQ ID NO: 645, SEQ ID NO: 670, SEQ ID NO: 

25 746, SEQ ID NO: 493, SEQ ID NO: 540, SEQ ID NO: 612, SEQ ID NO: 629, SEQ ID 
NO: 484, SEQ ID NO: 485, SEQ ID NO: 486, and SEQ ID NO: 433. 

81. A vaccine composition for prevention or treatment of an H. pylori infection 
comprising an effective amount of an K pylori polypeptide of claim 56. 

30 

82. A vaccine composition of claim 8 1 , further comprising a pharmaceutical^ 
acceptable carrier. 

83. A vaccine composition of claim 82, wherein the pharmaceutically acceptable 
35 carrier is an adjuvant. 

84. A method of treating a subject for H. pylori infection comprising 
administering to a subject a vaccine composition of claim 81, such that treatment of H. 
pylori 

40 infection occurs. 
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85. A method of claim 84, wherein the treatment is a prophylactic treatment 

86. Amemodofclaim84,wheremthetreamientisati^ 

5 

87. A vaccine composition for prevention or treatment of an H. pylori infection 
comprising an effective amount of an K pylori polypeptide of claim 61. 

88. A vaccine composition of claim 87, further comprising a pharmaceutically 
10 acceptable carrier. 

89. A vaccine composition of claim 88, wherein the pharmaceutically acceptable 
carrier is an adjuvant 

15 90. A method of treating a subject for H. pylori infection comprising 

administering to a subject a vaccine composition of claim 87, such that treatment of H. 
pylori 

infection occurs. 

20 91. A method of claim 90, wherein the treatment is a prophylactic treatment 

92. Amemodofclami90,wheremmetreatmentisamerapeutictreato 

93. A vaccine composition for prevention or treatment of an K pylori infection 
25 comprising an effective amount of en H. pylori polypeptide of claim 72. 

94. A vaccine composition of claim 93, further comprising a pharmaceutically 
acceptable carrier. - 

30 95. A vaccine composition of claim 94, wherein the pharmaceutically acceptable 

carrier is an adjuvant 

96. A method of treating a subject for H. pylori infection comprising 
aciministering to a subject a vaccine composition of claim 93, such that treatment of H. 

35 pylori 

infection occurs. 

97. A method of claim 96, wherein the treatment is a prophylactic treatment. 
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98. A method of claim 96, wherein the treatment is a therapeutic treatment 

99. A vaccine composition for prevention or treatment of an H. pylori infection 
comprising an effective amount of an K pylori polypeptide of claim 73. 

5 

100. A vaccine composition of claim 99, further comprising a pharmaceutically 
acceptable carrier. 

101. A vaccine composition of claim 1 00, wherein the pharmaceutically 
10 acceptable carrier is an adjuvant 

102. A method of treating a subject for H. pylori infection comprising 
administering to a subject a vaccine composition of claim 99, such that treatment of K 
pylori 

15 infection occurs. 

103. A method of claim 102, wherein the treatment is a prophylactic treatment 

104. A method of claim 102, wherein the treatment is a therapeutic treatment. 

20 

105. A method for detecting the presence of a Helicobacter nucleic acid in a 
sample comprising: 

(a) contacting a sample with a nucleic acid of claim 1 under conditions 
in which a hybrid can form between the probe and a Helicobacter nucleic acid in the 

25 sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a hybrid 
indicates the presence of a Helicobacter nucleic acid in the sample. 

1 06. A method for detecting the presence of a Helicobacter nucleic acid in a 
30 sample comprising: 

(a) contacting a sample with a nucleic acid of claim 9 under conditions 
in which a hybrid can form between the probe and a Helicobacter nucleic acid in the 
sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a hybrid 
35 indicates the presence of a Helicobacter nucleic acid in the sample. 



1 07. A method for detecting the presence of a Helicobacter nucleic acid in a 
sample comprising: 
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(a) contacting a sample with a nucleic acid of claim 23 under conditions 
in which a hybrid can form between the probe and a Helicobacter nucleic acid in the 
sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a hybrid 
5 indicates the presence of & Helicobacter nucleic acid in the sample. 

108. A method for detecting the presence of a Helicobacter nucleic acid in a 
sample comprising: 

(a) contacting a sample with a nucleic acid of claim 27 under conditions 
10 in which a hybrid can form between the probe and a Helicobacter nucleic acid in the 

sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a hybrid 
indicates the presence of a Helicobacter nucleic acid in the sample. 
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370 

Asp He Glu Val Gly He Val He Gly Gly Gly Asn He He Arg Glv 

20 25 30 

Val Ser Ala Ala Leu Gly Gly He He Arg Arg Thr Ser Gly Asp Tyr 

.35 .40 45 

Met Gly Met Leu Ala Thr Val He Lys Arg 
50 55 



(2) INFORMATION FOR SEQ ID NO: 447: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TOPE: protein 

(ill) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: *misc_f eature 

(B) LOCATION 1...85 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 

Val His Asn Phe His Trp Asn Val Lys Gly Thr Asp Phe Phe Asn Val 

15 10 15 

His Lys Ala Thr Glu Glu He Tyr Glu Gly Phe Ala Asp Met Phe Asp 

20 25 30 

Asp Leu Ala Glu Arg He Val Gin Leu Gly His His Pro Leu Val Thr 

35 40 45 

Leu Ser Glu Ala He Lys Leu Thr Arg Val Lys Glu Glu Thr Lys Thr 

50 55 60 

Ser Phe His Ser Lys Asp He Phe Lys Glu He Leu Glu Asp Tyr Lys 
€5 .70 75 80 

His Leu Glu Lys Glu' 
85 



(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 
<B) LOCATION 1...90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:448 

Met Asn Lys Thr He Lys Ala Ala Ala Leu Ala Tyr Asn Met Gly Gin 

15 10 15 

Asp His Ala Pro Ly s Val He Ala Ser Gly Val Gly Glu Val Ala Lys 
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383 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 



Val 


Lys 


Lys Tyr 


Ala 


Glu 


Asp 


Phe 


He Thr 


Lys Asp Glu Val 


Lys Ser 


1 






5 








10 


* 


15 


Leu 


Leu 


Glu Arg 


Leu 


Ala 


Lys 


Asp 


Tyr Pro 


Thr He Val Glu 


Glu Ser 


Lys 




20 










25 


30 




Lys 


lie Pro 


Thr 


Gly 


Ala 


He 


Arg Ser 


Val Leu Gin Ala 


Leu Leu 






35 








40 




45 




His. 


Glu 


Lys He 


Pro 


He 


Lys 


Asp 


Met Leu 


Thr He Leu Glu 


Thr He 




50 








55 




60 




Thr 


Asp 


He Ala 


Pro 


Leu 


Val 


Gin 


Asn Asp 


Val Asn He Leu 


Thr Glu 


65 








70 






75 


80 


Gin 


Val 


Arg Ala 


Arg 


Leu 


Ser 


Arg 


Val He 


Thr Asn Ala Phe 


Lys Ser 








85 








90 




95 


Glu 


Asp 


Gly Arg 


Leu 


Lys 


Phe 


Leu 


Thr Phe 


Ser Thr Asp Unk 


Glu Gin 






100 










105 


110 





Phe Unk Ala Gin 



115 



(2) INFORMATION FOR SEQ ID NO:465: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...153 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 



Val 


Gly Ala 


Asn 


Pro 


Val 


Pro 


His 


Ala Gin He 


Leu Gin Ser Val Val 


1 ' 






5 








10 


15 


Asp 


Asp Leu 


Lys 


Glu 


Lys 


Gly 


He 


Lys Leu Val 


He Val Ser Phe Thr 






20 










25 


30 


Asp 


Tyr Val 


Leu 


Pro 


Asn 


Leu 


Ala 


Leu Asn Asp 


Gly Ser Leu Asp Ala 




35 










40 




' 45 


Asn 


Tyr Phe 


Gin 


His 


Arg 


Pro 


Tyr 


Leu Asp Arg 


Phe Asn Leu Asp Arg 




50 








55 






60 


Lys 


Met His 


Leu 


Val 


Gly 


Leu 


Ala 


Asn He His 


Val Glu Pro Leu Arg 


65 








70 






75 


80 


Phe 


Tyr Ser 


Gin 


Lys 


He 


Thr 


Asp 


He Lys Asn 


Leu Lys Lys Gly Ser 








85 








90 


95 


Val 


He Ala 


Val 


Pro 


Asn 


Asp 


Pro 


Ala Asn Gin 


Gly Arg Ala Leu He 






100 










105 


110 


Leu 


Leu His 


Lys 


Gin 


Gly 


Leu 


He 


Ala Leu Lys 


Asp Pro Ser Asn Leu 




115 










120 




125 


Tyr 


Ala Thr 


Glu 


Phe 


Asp 


He 


Val 


Lys Asn Pro 


Tyr Asn He Lys He 




130 








135 






140 


Lys 


Pro Leu 


Glu 


Ala 


Ala 


Val 


He 


Ala 




145 








150 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO:466: 
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543 

65 90 95 

Tyr Leu Gin Glu Lys Unk Unk He Phe Pro Thr Unk Lys Unk Unk Unk 

100 105 110 

Ser Lys He Met Arg Leu Asp Gly Val Lys His Val Glu Lys Asn Leu 

115 120 125 

Lys Asp Asn Leu Glu Glu Met Ser Asp Glu Val Lys Asn Lys Gly Ser 

130 135 140 

He Asp Asn Ala Lys Glu, Ser Phe Asn Lys Gly Tyr Gly 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 649: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid > 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

. (vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori " 

(ix) FEATURE: 

' (A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . -312 

(xi) SEQUENCE DESCRIPTION: SEQ* ID NO: 649 

Met Lys Ser Asp Lys Pro Phe Leu Glu Arg Tyr Phe Tyr Asp Pro Thr 

1 5 . 10 15 

Leu Leu Gin Lys Gly Leu He Phe Ala Leu Tyr Pro Phe Ser Leu He . 

20 .25 30 

Tyr Gin Cys He Ala Thr He Lys Arg Lys Thr Ala Lys Lys His Asp 

35 40 45 

Phe Lys He Pro He He Ser He Gly Asn Leu He Ala Gly Gly Ser 

50 55 60 - 

Gly Lys Thr Pro Phe He Leu Glu He Ala Pro Arg Tyr Gin Glu Val 
65 70 75 80 

Ala Val Val Ser Arg Gly Tyr Gin Arg Asp Ser Lys Gly Leu Val Val 

85 90 95 

Val Ser Val Lys Gly Asn He Leu Val Pro Gin Lys Thr Ala Gly Asp 

100 105 110 

Glu Ala Tyr Leu Leu Ala Leu Asn Leu Lys Gin Ala Ser Val He Val 

115 120 125 

Ser Glu Lys Arg Glu Leu Gly Val Leu Lys Ala Leu Glu Leu Gly Ser 

130 . 135 140 

Lys He Val Phe Leu Asp Asp Gly Phe Arg Phe Asn Phe Asn Gin Phe 
145 150 155 160 

Asn Ala Leu Leu Lys Pro Lys Val Pro Pro Tyr Tyr Pro Phe Cys Leu 

165 170 175 

Pro Ser Gly Leu Tyr Arg Glu Asn He Lys Ser Tyr Lys Glu Ala His 

180 185 190 

Leu Val He Thr Glu Asp Lys Asp Tyr Gin Arg He Thr Ser He Thr 

195 200 205 

Asn Pro Thr Lys Arg Met Leu Leu Val Thr Ala He Ala Asn Pro Ser 

210 215 220 

Arg Leu Asp Ala Phe Leu Pro Lys Glu Val Val Lys Lys Leu Tyr Phe 
225 230 235 240 

Arg Asp His Ala Pr Phe Asp Leu Lys Leu Leu Glu Lys Glu Phe Tyr 

245 250 255 

Gin Asn Asn Ala Thr Ser Leu Leu Val Thr Ser Lys Asp Leu Val Lys 
260 265 270 
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Leu Gin Asp Cys Lys Leu Pr Leu Ser Val Leu Asp Leu Lys Leu Glu 

275 280 285 

He Cys Pro Lys Val Leu Glu Glu He Asp Arg Tyr He Leu Ser Tyr 

290 295 300 

Pro Cys Asn He Lys Glu His Leu 
305 .310 



(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: _. . 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1...56 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 

Met Ser Leu Gly Ala Val He Arg Leu He Phe Cys Tyr Lys Leu Glu 

15 10 15 

Gly Val He . Leu Asp Leu Lys Arg He Asn Phe Lys Ser Tyr Tyr Pro 

20 25 30 

Asn Asn Lys Asn Ala Leu Phe He Asn Asn Lys Lys Asn Pro Leu Ser 

35 40 45 

Ser Thr Ser Lys Phe He Leu Leu 
50 55 



(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii). MOLECULE TYPE: protein • . 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...199 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651 

Met Leu Glu Thr THr He Asp Phe Ser Arg Tyr Ser Ser Val Lys He 

1 5 10 15 

Gly Ala Pro Leu Lys Val Ser Val Leu Glu Asn Asp Asn Glu He Ser 

20 25 30 

Gin Glu His Gin He He Gly Leu Ala Asn Asn Leu Leu He Ala Pro 

35 40 45 

Asp Val Lys Asn Leu Ala Leu Leu Gly Lys Asn Tyr Asp Tyr He Cys 
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(lx) FEATURE: 

(A) NAME/KEY: misc.f eature 

(B) LOCATION 1...100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657 

Met Gly Ala lie Ala Ser Cys Tyr Ala His Gin lie lie Leu Thr Ser 

1 S 10 ' 15 

Asp Asn Pro Arg Ser Glu Asn Glu Glu Asp lie lie Lys Asp lie Leu 

20 25 30 

Lys Gly lie Asn Asn Ser Ser Lys Val lie Val Glu Lys Asp Arg Lys 

35 40 45 

Lys Ala He Leu Asn Ala Leu Glu Asn Leu Lys Asp Asp Glu Val Leu 

50 55 60 

Leu He Leu Gly Lys Gly Asp Glu Asn lie Gin He Phe Lys Asp Lys 
65 70 75 80 

Thr He Phe Phe Ser Asp Gin Glu Val Val Lys Asp Tyr -Tyr Leu Asn 
85 90 95 

Leu Lys Gin Gly 
100 



(2) INFORMATION FOR SEQ ID NO: 658: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mise_f eature 

(B) LOCATION 1. . .80 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 • 

Val Met Asp Lys Leu Thr Lys Ser Leu Gin Thr Gin Lys Asn Phe Ala 

1 .5 10 15 

Tyr Leu Gly Lys He Lys Pro Gin Gin Leu Ala Asp Phe He He Asn 

20 25 30 

Glu His Pro Gin Thr He Ala Leu He Leu Ala His Met Glu Unk Pro 

35 40 45 

Asn Ala Ala Glu Thr Leu Ser Tyr Phe Pro Asp Glu Met Lys Ala Glu 

50 55 60 

He Ser He Arg Met Ala Asn Phe Arg Arg Ash He Ala Pro Ser Gly 
65 70 75 ■ . 80 



(2) INFORMATION FOR SEQ ID NO: 659: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 



565 

50 55 60 

Leu Asn Glu Ala Leu Gly Lys Lys Phe Asn Leu Pr Met Asp Asn He 
65 70 75 80 

Lys Asn Phe Met Ala Glu Val Leu Lys Asn Gly Phe Asp Ser He Lys 

85 90 95 

Asn Met Gly Ser Ala Leu Val Gly Asn Gly Phe Gly ser Ser Lys Ser 

100 105 110 

Asp Lys Thr Thr Asn Lys Met Ser Val Pro Gin Val Arg Leu 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 677: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori ■ 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) ' LOCATION 1. ..195 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 

Val Gly Ala Met Pro Thr He Gin He Arg Unk Phe Gly Ala Gly Gly 

1 5 10 15 

Ser Gly His Ser Asp Ala Thr Leu Met Leu Val Asn Gly lie Pro Val 

20 25 . 30 

Tyr Met Ala Pro Tyr Ala His lie Glu Leu Asp He Phe Pro Val Thr 

35 . 40 45 . 

Phe Gin Ala He Asp Arg He Asp Val He Lys Gly Gly Gly Ser Val 

50 55 60 

Gin O^/r Gly Pro Asn Thr Tyr Gly Gly He Val Asn lie He Thr. Lys 
65 70 75 80 

Pro He Pro Asn Gin Trp Glu Asn Gin Ala Ala Glu Arg Unk Thr Tyr 

85 90 . 95 

Trp Ala Lys Ala Arg Asn Ala Gly Phe Ala Ala Pro Unk Asp Lys Thr 

100 105 110 

Gly Asp Pro Ser Phe He Lys Ser Leu Gly Asn Asn Leu Leu Tyr Asn 

115 120 125 

Thr Tyr Val Arg Ser Gly Gly Met He Asn Lys His Val Gly He Gin 

130 135 140 

Arg Lys Leu Thr Gly Leu Glu Ala Lys Ala Leu Gly Thr He Ala Pro 
145 150 155 160 

Leu Val Phe Gin Thr He Gly Trp Met Gly Ser Met Thr Ser Met Lys 

165 170 175 

Ala Met Gly Leu Lys Pro He Thr Asn Thr Thr He Leu Ala He Unk 
180 185 190 

Gin Pro Gly 
195 



(2) INFORMATION FOR SEQ ID NO: 677: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



inns* niscrrT 
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(B) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pr tein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori • 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...84 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 

Met Arg Unk Glu Lys lie Met Thr Asn Phe Glu Lys Unk lie Ala Gin 

1 5 10 .15 

Asn Arg Leu Lys Thr Asn Ala Val Leu Thr Thr Tyr Cys Ala lie Phe 

20 . 25 30 

Ala Phe lie Gly Leu Leu Val Asp Ala lie Arg lie Asn Ala Asn Asp 

35 40 45 

Leu Gly lie Ala Leu Phe Lys Leu Met Thr Phe Gin lie Phe Pro Thr 

50 55 60 

Unk Thr He Val Met Phe Val Val Ala Phe Val tie Unk Unk Ser Leu 
65 70 75 80 

Tyr Pro Lys Phe 



(2) INFORMATION FOR SEQ ID NO: 685: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear. 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...76 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 

Met Unk Met Ser His He He Lys Ser He Glu Ala Leu Asp Asp Tyr 

1 5 .10 15 

Thr He Arg Phe Thr Leu Asn Gly Pro Glu Ala Pro Phe Leu Ala Asn 

20 25 30 

Leu Gly Met Asp Phe Leu Ser He Leu Ser Lys Asp Tyr Ala Asp Tyr 

35 40 . 45 

Leu Ala Gin Asn Asn Lys Lys Asp Glu Leu Ala Lys Unk Pro Val Gly 

50 55 • 60 

Thr Gly Pro Phe Lys Phe Phe Leu Trp Asn Lys Arg 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 
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(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION 1...253 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 



Met Ala Tyr Lys Tyr Asp Arg Asp Leu Glu Phe Leu Lys Gin Leu Glu 

1 5 10 ; 15 

Ser Ser Asp Leu Leu Asp Leu Phe Glu Val Leu Val Phe Gly Lys Asp 

20 25' 30 

Gly Glu Lys Arg His Asn Glu Lys Leu Thr Ser Ser He Glu Tyr Lys 

35 40 45 

Arg His Gly Asp Asp Tyr Ala Lys Tyr Ala Glu Arg He Ala Glu Glu 

50 55 - 6Q 

Leu Gin Tyr Tyr Gly Ser Asn Ser Phe Ala Ser Phe He Lys Gly Glu 
65 70 75 80 

Gly Val Leu Tyr Lys Glu He Leu Cys Asp Val Cys Asp Lys Leu Lys 

85 90 95 

Val Asn Tyr Asn Lys Lys Thr Glu Thr Thr Leu He Glu Gin Asn Met 

100 105 . ' 110 

Leu Ser Lys He Leu Glu Arg Ser Leu Glu Glu Met Asp Asp Glu Glu 

115 120 125 

Val Lys Glu Met Cys Asp Glu Leu Ser He Lys" Asn Thr Asp Asn Leu 

130 135 140 

Asn Arg Gin Ala Leu Ser Ala Ala Thr Leu Thr Leu Phe Lys Met Gly 
145 150 155 160 

Gly Phe Lys Ser Tyr Gin Leu Ala Val He Val Ala Asn Ala Val Ala 

165 170 175 

Lys Thr He Leu Gly Arg Gly Leu Ser Leu Ala Gly Asn Gin Val Leu 

180 185 190 

Thr Arg Thr Leu Ser Phe Leu Thr Gly Pro Val Gly'Trp He He Thr 

195 200 205 

Gly Val Trp Thr Ala He Asp He Ala Gly Pro Ala Tyr Arg Val Thr 

210' 215 " 220 

He Pro Ala Cys He Val Val Ala Thr Leu Arg Leu Lys Thr Gin Gin 
225 230 235 240 

Ala Asn Glu Asp Lys Lys Ser Leu Gin He Glu Ser Val 
245 250 



§8§1§ 



(2) INFORMATION FOR SEQ ID NO:786s 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 173 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



:*C- :■- .\ i ■> 



(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

-(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1...173- 



u 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 6 



Val Phe Leu Val Gin Ser Trp Ala Leu Ser Leu Lys He Asp Ser Leu 

1-5 10 15 

Phe Ser Leu Phe Ser Val Gly Lys He Pro Ser Gly Ser Lys Asp Pro 

20 25 30 

Phe Ala Leu Arg Arg Leu Ser Phe Gly Leu Leu Lys He He Ala His 
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(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: * 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...340 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 

Met He Leu Ser He Glu Ser Ser Cys Asp Asp Ser Ser Leu Ala Leu 

15 10 15 

Thr Arg He Glu Asp Ala Lys Leu He Ala His Phe Lys He Ser Gin 

20 25 .30 

Glu Lys His His Ser Ser Tyr Gly Gly Val Val Pro Glu* He Ala Ser 

35 40 45 

Arg Leu His Ala Glu Asn Leu Pro Leu Leu Leu Glu Arg Val Lys He 

50 55 .60 

Ser Leu Asn Lys Asp Phe Ser Lys He Lys Ala He Ala He Thr Asn 
65 70 75 80 

Gin Pro Gly Leu Ser Val Thr Leu He Glu Gly Leu Met Met Ala Lys 

85 90 95' 

Ala Leu Ser Leu Ser Leu Asn Leu Pro Leu He Leu Glu Asp His Leu 

100 105 110 

Arg Gly His Val Tyr Ser Leu Phe He Asn Glu Lys Gin Thr Arg Met 

115 120 125 

Pro Leu Ser Val Leu Leu Val Ser Gly Gly His Ser Leu He Leu Glu 

130 135 140 

Ala Arg Asp Tyr Glu Asp He Lys He Val Ala Thr Ser Leu Asp Asp 
145 . 150 155 160 

Ser Phe Gly Glu Ser Phe Asp Lys Val Ser Lys Met Leu Asp Leu Gly 

165 170 175 

Tyr Pro Gly Gly Pro He Val Glu Lys Leu Ala Leu Asp Tyr Ala His 

180 185 190 

Pro Asn Glu Pro Leu Met Phe Pro He Pro Leu Lys Asn Ser Pro Asn 

195 200 205 

Leu Ala Phe Ser Phe Ser Gly Leu Lys Asn Ala Val Arg Leu Glu Val 

210 215 220 

Glu Lys Asn Ala His Asn Leu Asn Asp Glu Val Lys Gin Lys He Gly 
225 230 235 ' 240 

Tyr His Phe Gin Ser Ala Ala He Glu His Leu He Gin Gin Thr Lys 

245 250 255 

Arg Tyx Phe Lys He Lys Arg Pro Lys He Phe Gly He Val Gly Gly 

260 265 270 

Ala Ser Gin Asn Leu Ala Leu Arg Lys Ala Phe Glu Asp Leu Cys Ala 

275 280 285 

Glu Phe Asp Cys Glu Leu Val Leu Ala Pro Leu Glu Phe Cys Ser Asp 

290 295 300 

Asn Ala Ala Met He Gly Arg Ser Ser Leu Glu Ala Tyr Gin Lys Lys 
305 310 315 320 

Arg Phe He Pro Leu Glu Lys Ala Asp He Ser Pro Arg Thr Leu Leu 
325 330 335 

Xys Asn Phe Glu 
340 



(2) INFORMATION FOR SEQ ID NO: 813: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



a. a, 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 202 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1...202 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 



Met 


Lys 


Tyr Leu 


Trp 


Leu 


Phe 


Leu 


He Tyr 


Ala He Gly 


Leu 


Phe Ala 


1 






5 








10 




15 . 


Thr 


Asp 


Lys Thr 


Leu 


Asp 


He 


He 


Lys Thr 


lie Gin Lys 


Leu 


Pro Lys 






20 










25 




30 


lie 


Glu 


Val 'Arg 




Ser 


He 


Asp 


Asn Asp 


Ala Asn Tyr 


Ala 


Leu Lys 






35 








40 




45 




Leu 


His 


Glu Val 


Leu 


Ala 


Asn 


Asp 


Leu Lys 


Thr Ser Gin 


His 


Phe Asp 




50 








55 






60 




Val 


Sex 


Gin Asn 


Lys 


Glu 


Gin 


Gly 


Ala He 


Asn Tyr Ala 


Glu 


Leu Lys 


65 








70 








75 




80 


Asp 


Lys 


Lys Val 


His 


Leu 


Val 


Ala 


Leu Val 


Ser Val Ala 


Val 


Glu Asn 








85 








90 






95 


Gly 


Asn 


Lys lie 


Sex 


Arg 


Leu 


Lys 


Leu Tyr 


Asp Val Asp 


Thr 


Gly Thr 






100 










105 


110 


Leu 


Lys 


Lys. Thr 


Phe 


Asp 


Tyr 


Pro 


lie Val 


Ser Leu Asp 


Leu 


Tyr Pro 






115 








120 




125 




Phe 


Ala 


Ala His 


Asn 


Met 


Ala 


He 


Val Val 


Asn Asp Tyr 


Leu 


Lys Ala 




130 








135 






140 




Pro 


Ser 


He Ala 


Trp 


Met 


Lys 


Arg 


Leu He 


Val Phe Ser 


Lys 


Tyr He 


145 








150 








155 ; 


160 


Gly 


Pro 


Gly He 


Thr 


Asn 


He 


Ala 


Leu Ala 


Asn Tyr Thr 


Met 


Arg Tyr 








165 








170 




175 


Gin 


Lys 


Glu lie 


He 


Lys 


Asn 


Asn 


Arg Leu 


Asn lie Phe 


Pro 


Lys Trp 






180 










185 . 




190 


Ala 


Asn 


Ala Glu 


Gin 


Thr 


Glu 


Phe 


Tyr Tyr 









195 200 



(2) INFORMATION FOR SEQ ID NO: 820: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...168 

(Xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 820 
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Met Leu 


Ser 


Ser 


Asn 


Asp 


Leu 


Phe 


Met 


Val Val Leu Gly Ala He Leu 


1 






5 










10 15 


Leu Val 


Leu 


Val 
20 


Cys 


Leu 


Val 


Gly 


Tyr 
25 


Leu Tyr Leu Lys Glu Lys Glu 
30 


Phe Tyr 


His 
35 


Lys 


Met 


Arg 


Arg 


Leu 
40 


Glu 


Lys Thr Leu Asp Glu Ser Tyr 
45 


Gin Glu 


Asn 


Tyr 


Leu 


Tyr 


Ser 


Lys 


Arg 


Leu Arg Glu Leu Glu Gly Arg 


50 










55 






60 


Leu Glu 


Gly 


Leu 


Ser 


Leu 


Glu 


Lys 


Ser 


Ala Lys Glu Asp Ser Ser Leu 


65 








70 








75 80 


Lys Thr 


Thr 


Leu 


Ser 
85 


His 


Leu 


Tyr 


Asn 


Gin Leu Gin Glu He Gin Lys 
90 95 


Ser Met 


Asp 


Lys 
100 


Glu 


Arg 


Asp 


Tyr 


Leu 
105 


Glu Glu Lys He He Unk Unk 
110 


Lys Thr 


Unk 
115 


Unk 


Lys 


Thr 


Trp 


Gly 
120 


He 


Met Pro Leu Ala Met Lys Ser 
125 


Thr Glu 


Lys 


Gin 


Val 


Leu 


Lys 


Met 


Tyr 


Gin Glu Gly Tyr Ser Val Asp 


130 










135 






140 


Ser lie 


Ser 


Lys 


Glu 


Phe 


Lys 


Val 


Ser 


Lys Gly Glu Val Glu Phe He 


145 








150 








155 160 


Leu Asn 


Met 


Ala 


Gly 
165 


Leu 


Lys 


Trp 







(2) INFORMATION FOR SEQ ID NO: 820: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TOPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TOPE: protein 

(ili) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION 1...168 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820 



Met 


Leu Ser 


Ser 


Ash 


Asp 


Leu 


Phe 


Met 


Val 


Val 


Leu Gly Ala He Leu 


1 






5 










10 




15 


Leu 


Val Leu 


Val 


Cys 


Leu 


Val 


Gly 


Tyr 


Leu 


Tyr 


Leu Lys Glu Lys Glu 






20 










25 






30 


Phe 


Tyr His 


Lys 


Met 


Arg 


Arg 


Leu 


Glu 


Lys 


Thr 


Leu Asp Glu Ser Tyr 




35 










40 








45 


Gin 


Glu Asn 


Tyr 


Leu 


Tyr 


Ser 


Lys 


Arg 


Leu 


Arg 


Glu Leu Glu Gly Arg 




50 








55 










60 


Leu 


Glu Gly 


Leu 


Ser 


Leu 


Glu 


Lys 


Ser 


Ala 


Lys 


Glu Asp Ser Ser Leu 


65 








70 










75 


80 


Lys 


Thr Thr 


Leu 


Ser 


His 


Leu 


Tyr 


Asn 


Gin 


Leu 


Gin Glu He Gin Lys 








85 










90 




95 


Ser 


Met Asp 


Lys 


Glu 


Arg 


Asp 


Tyr 


Leu 


Glu 


Glu 


Lys He He Unk Unk 






100 










105 






110 


Lys 


Thr Unk 


Unk 


Lys 


Thr 


Trp 


Gly 


He 


Met 


Pro 


Leu Ala Met Lys Ser 




115 










120 








125 


Thr 


Glu Lys 


Gin 


Val 


Leu 


Lys 


Met 


Tyr 


Gin 


Glu 


Gly Tyr Ser Val Asp 




130 








135 










140 


Ser 


He Ser 


Lys 


Glu 


Phe 


Lys 


Val 


Ser 


Lys 


Gly 


Glu Val Glu Phe He 


145 








150 










155 


160 
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260 



265 



(2) INFORMATION FOR SEQ ID NO: 880: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
<B) LOCATION 1... 3 63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880 

Met Lys Phe Phe Leu Leu Lys Lys Phe Ser Unk Phe Leu Asn Thr Gin 

15 10 is 

Thr His Phe Asn Leu Lys Arg Leu Asn Ala Ser Ser Phe Leu Leu Glu 

20 25 30 . 

Thr Phe Ser Lys Glu Lys His Ala Phe Val Val Asp Leu Ser Ala Pro 

35 40 45 

Tyx lie Gly Leu Ser Lys Lys Pro Pro Glu Ser Val Leu Lys Asn Thr 

50- 55 60 

Leu Ala Leu Asp Phe Cys Leu Asn Lys Phe Thr Lys Asn Ala Lys lie 1 
65 70 75 80 

Leu Gin Ala Asn Val lie Asp Asn Asp Arg lie Leu Glu lie Lys Gly 

85 .90 95 

Ala Lys Asp Leu Ala Tyr Lys Ser Glu Thr Phe lie Leu Arg Leu Glu 

100 - 105 HO 

Met lie Pro Lys Lys Ala Asn Leu Met lie Leu Asp Gin Glu Lys Cys 

115 120 125 

Val lie Glu Ala Phe Arg Phe Asn Asp Arg Val Ala Lys Asn Asp lie 

130 135 140 

Leu Gly. Ala Leu Pro Pro Asn lie Tyr Glu His Gin Glu Glu Asp Leu 
145 150 155 160 

Asp Phe Lys Gly Leu Leu Asp He Leu Glu Lys Asp Phe Leu Ser Tyr 

165 170 " 175 

Gin His Lys Glu Leu Glu His Lys Lys Asn Gin He He Lys Arg Leu 

180 185 190 

Asn Ala Gin Lys Glu Arg Leu Lys Glu Lys Leu Glu Lys Leu Glu Asp 

195 200 205 

Pro Lys Thr Leu Gin Leu Glu Ala. Lys Glu Leu Gin Thr Gin Ala Ser 

210 215 220 

Leu Leu Leu Thr Tyr Gin His Leu He Asn Arg Arg Glu Asn Arg Val 
225 230 235 240 

He Leu Lys Asp Phe Glu Asp Lys Glu Cys Met He Glu He Asp Lys 

245 250 255 

Ser Met Pro Leu Asn Ala Phe He Asn Lys Lys Phe Thr Leu Ser Lys 

260 265 270 

Lys Lys Lys Gin Lys Ser Gin Phe Leu Tyr Leu Glu Glu Glu Asn Leu 

275 280 285 

Lys Glu Lys He Ala Phe Lys Glu Asn Gin He Asn Tyr Val Arg Asp 

290 295 300 

Ala Ala Glu Glu Ser Val Leu Glu Met Phe Met Pro Val Lys Asn Ser 
305 310 315 320 

Lys He Lys Arg "Pro Met Asn Gly Tyr Glu Val Leu Tyr Tyr Lys Asp 



325 



330 



335 
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Unk Lys Unk Gly Leu Gly Lys Thr Lys Lys Arg He Ser Ser Phe Tyr 

340 345 350 

Lys Thr Gin Unk Arg Met He Unk Gly Cys Unk 
355 360 



(2) INFORMATION FOR SEQ ID NO:880: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...363 

(3d.) SEQUENCE DESCRIPTION: SEQ ID NO: 880 

Met Lys Phe Phe Leu Leu Lys Lys Phe Ser Unk Phe Leu Asn Thr Gin 

Thr His Phe Asn Leu Lys Arg Leu Asn Ala Ser Ser Phe Leu Leu Glu- 

20 .25 30 

Thr Phe Ser Lys Glu Lys His Ala Phe Val Val Asp Leu Ser Ala Pro 

.35 40 45 

Tyr He Gly Leu Ser Lys Lys Pro Pro Glu Ser Val Leu LysrAsn Thr 

50 '55 60 

Leu Ala Leu Asp Phe Cys Leu Asn Lys Phe Thr Lys Asn Ala Lys lie 
f 5 , 70 ' 75 80 

Leu Gin Ala Asn Val He Asp Asn Asp Arg He Leu Glu He Lys Glv 

85. 90 95 

Ala Lys Asp Leu Ala Tyr Lys Ser Glu Thr Phe He Leu Arg Leu Glu 

100 105 no 

Met He Pro Lys Lys Ala Asn Leu Met lie Leu Asp Gin Glu Lys Cys 

115 120 125 

Val He Glu Ala Phe Arg Phe Asn Asp Arg Val Ala Lys Asn Asp He 
130 135 , i4 0 

Leu Gly Ala Leu Pro Pro Asn He Tyr Glu His Gin Glu Glu Asp Leu 
145 150 . 155 160 

Asp Phe Lys Gly Leu Leu Asp He Leu Glu Lys Asp Phe Leu Ser Tyr 

165 170 i75 

Gin His Lys Glu Leu Glu His Lys Lys Asn Gin He He Lys Arg Leu 

180 . 185 190 

Asa Ala Gin Lys Glu Arg Leu Lys Glu Lys Leu Glu Lys Leu Glu Asp 

195 200 205 

.Pro Lys Thr Leu Gin Leu Glu Ala Lys Glu Leu Gin Thr Gin Ala Ser 

210 215 220 

Leu Leu Leu Thr Tyr Gin His Leu He Asn Arg Arg Glu Asn Arg Val 
225 230 235- 240 

He Leu Lys Asp Phe Glu Asp Lys Glu cys Met He Glu He Asp Lys 

245 250 255 

Ser Met Pro Leu Asn Ala Phe He Asn Lys Lys Phe Thr Leu Ser Lys 

260 265 270 

Lys Lys Lys Gin Lys Ser Gin Phe Leu Tyr Leu Glu Glu Glu Asn Leu 

275 280 285 

Lys Glu Lys He Ala Phe Lys Glu Asn Gin He Asn Tyr Val Arg Asn 

290 295 300 

Ala Ala Glu Glu Ser Val Leu Glu Met Phe Met Pro Val Lys Asn Ser 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...233 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 



Leu 


Ser Leu 


Met 


Unk 


Val 


Leu 


Asn 


Ala 


Lys Glu Cys Val Unk 


Pro He 


1 






5 








Gin 


10 


15 

mi 


Thr 


Arg Ser 


Val 


Lys 


Tyr 


His 


Gin 


Ser Ala Glu He Ara 


A. la Tiftii 






20 










25 


3D 




Gin 


Leu Gin 


Ser 


Tyx 


Lys 


Met 


Ala 


Lys 


Met Ala Leu Asp Asn 






35 










40 




45 

mi 




lys 


Leu Val 


Lys 


Asp 


Lys 


Lys 


Pro 


Ala 


Val He Leu Asn' Lon 

w»** **• U«U ASM JJOU 


Asp Glu 




50 








55 






9 W 


Thr 


Val Leu 


Asn. 


Thr 


Phe 


Asp 


Tvr 


Ala 


Glv Tvr Leu Val T.vs 

^* mm J mm, J m* *J9M VBA wmi T Si 


Asn Cys 


65 








70 












lie 


Lvs Tvr 


Thr 


Pro 


Glu 


Thr 




Asp 


ii/s rue ulu liys vvJLU 


Gly Ser 








85 










90 


95 


Leu 




lie 


Pro 

* A w 


Glv 


Ala 


T.Alt 


Asp 


jrne jjeu \>iu uyr Ala 


Asn Ser 






100 










105 


110 




Lys 


Gly Val 


Lys 


He 


Phe 


Tyr 


He 


Ser 


Asn. Aircr TViy" ft! n TA*e 
<uy xaf win jjys 


Asn Lys 




115 










120 




125 


Ala 


Phe Thr 


Leu 


Lys 


Thr 


Leu 


Lys 


Ser 


Phe Lys Leu Pro Gin 


Val Ser 




130 








135 






140 




Glu 


Glu Ser 


Val 


Leu 


Leu 


Lys 


Glu 


Lys 


Gly Lys Pro Lys Ala 


Val Arg 


145 




• 




150 








155 


160 


Arg 


Glu Leu 


Val 


Ala 


Lys 


Asp 


Tyr 


Ala 


He Val Leu Gin Val 


Gly Asp 








165 










170 


175 


Thr 


Leu His 


Asp 


Phe 


Asp 


Ala 


He 


Phe 


Ala Lys Asp Ala Lys 


Asn Ser 






180 










185 


190 




Gin 


Glu Gin 


Gin 


Ala 


Lys 


Val 


Leu 


Gin 


Asn Ala Gin Lys Phe 


Gly Thr 




195 










200 




205 


Glu 


Trp lie 


He 


Leu 


Pro 


Asn 


Ser 


Leu 


Tyr Gly Thr Trp Glu 


Asp Gly 




210 








215 






220 


Pro 


He Lys 


Ala 


Trp 


Gin 


Asn 


Lys 


Lys 






225 








230 










(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 866: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...239 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866 



Met He Tyr Gly Val Leu Asp Gly Leu Phe Leu Ala He Leu Gin Ala 
1 5 10 15 



RESULT 1 
AAW20486 

ID AAW204 86 standard; protein; 253 AA. 



XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 

PI 

XX 

DR 

DR 

XX . 

PT 

PT 

PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 

so 



AAW20486; 

29-JUL-1997 (first entry) 
H. pylori cytoplasmic protein. 



3 



4095342. aa. 



Cytoplasmic; vaccine; prevention; treatment; infection; "f^lflMtion. 
binding compound; bacterium; life cycle; « c "v«tors tacttria. inhibitor, 
duodenal ulcer disease; chronic gastritis; diagnosis; envelope. 

Helicobacter pylori. 
WO9640893-A1. 
19-OEC-1996. 

06-JUN-1996; 96WO-OS09122 . 



01-APR-1996; 
07-JUN-1995; 



96OS-0630405. 
95US-0487032. 



pylori 



(ASTR ) ASTRA AB 

Berglindh OT, Smith D, Mellgaerd BL; 

WPI; 1997-052306/05. 
N-PSDB; AAT67811. 

Helicobacter pylori nucleic acid sequences and relat f<* . 
polypeptide^) - useful for vaccines to treat or prevent H. 

infection, and to detect Helicobacter 
Claim 61; Page 651; 1481pp; English. 

The present sequence is a H. pylori cytoplasmic protein. 
The protein may be used in a vaccine to prevent or treat H. pylori 
infection or to identify H. pylori polypeptide binding compounds, 
useful as potential H. pylori life cycle activators or inhibitors. 
The genomic sequence of H. pylori (ATCC 55679) was determined from 
overlapping contigs generated by mechanically shearing the bacterial 
DNA. The sequences were analysed for ORF of at least 180 nucleotides, 
and the predicted coding regions defined by computer evaluation To 
identify likely H. pylori antigens for vaccine development, the amino 
acid sequences predicted from various ORF were analysed for significant 
homology to other known or exported membrane proteins. Having identified 
and determined the sequences of interest, particular regions can *>« 
isolated from H. pylori by PCR amplification tor recombinant polypeptide 
production, e.g. in E. coli hosts. 

Sequence 253 AA; 



Query Match 99.3%; 
Best Local Similarity 99.2%; 
Matches 251; Conservative 



Score 1270; DB 18; 
Pred. No. 6e-118; 
1; Mismatches 1; 



Length 253; 
Indels 0, 



Gaps 



Qy 

Db 

Qy 



Db 



Qy 
Db 

Qy 
Db 



1 MAYKYDRDLEFLKQLESSDLLDLFEVLVFGKDGEKRHNEKLTSSIEYKRHGDDYAKYAER 60 

111 IMII MIIIIMlllltlMIMitlltllltlllillllltllllt III 

1 maykydrdleflkqlessdlldlfevlvfgkdgekrhnekltssleykrhgddyakyaer 60 

61 IAEELOYYGSNSFASFIKGEGVLYKEILCDVCDKLKVNYNKKTETTLIEQNMLSKILERS 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I M I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J J 1 1 M 1 1 I J 1 M I 
61 iaeelqyygsnsfasfikgegvlykeilcdvcdklkvnynkktettlieqnmlskilers 120 

121 LEEMDDEEVKEMCDELS I KNTDNLNRQALS AAT LT LFKMGG FKS YQLAVI VANAVAKT I L 180 

I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 I I I I 1 1 I 1 1 I I I I I I J I I I I I 1 1 I 1 1 1 1 1 1 1 1 1 I II I 
121 leemddeevkeracdelsikntdnlnrqalsaatltlfkmggfksyqlavivanavaktil 180 

181 GRGLSLAGNQVLTRTLSFLTGPVGWIITGVWTAIDIAGPAYRVTIPACIWATLRLKTQQ 240 

181 grglslagnqvltrtlsfltgpvgwiitgvwtaidiagpayrvtipacivvatlrlktqq 24 0 



Qy 
Db 



241 ANGDKKSLQIESI 253 

II lllllltlt: 
241 anedkkslqiesv 253 
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NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO . 
HELICOBACTER PYLORI FOR DIAGNOSTICS AND THERAPEUTICS 

Background of the Invention 

Helicobacter pylori is a gram-negative. S-shaped, microaerophilic bacterium that 
was discovered and cultured from a human gastric biopsy specimen. (Warren, J.R. and B. 
Marshall, (1983) Lancet I: 1273-1275; and Marshall et al., (1984) Microbios Lett 25: 83- 
88). H pylori has been strongly linked to chronic gastritis and duodenal ulcer disease. 
(Rathbone et. al., (1986) Gut Th 635-641). Moreover, evidence is accumulating for an 
etiologic role of H pylori in nonulcer dyspepsia, gastric ulcer disease, and gastric 
adenocarcinoma. (Blaser M. J., (1993) Trends Microbiol I: 255-260). Transmission of the 
bacteria occurs via the oral route, and the risk of infection increases with age. (Taylor, 
D.N. and M. J. Blaser, (1991) Epidemiol Rev 13: 42-50). H pylori colonizes the human 
gastric mucosa, establishing an infection that usually persists for decades. Infection by H 
pylori is prevalent worldwide. Developed countries have infection rates over 50% of the 
adult population, while developing countries have infection rates reaching 90% of the 
adults over the age of 20. (Hopkins R. J. and J. G. Morris (1994) Am. J. Med. 97: 265-277). 

The bacterial factors necessary for colonization of the gastric environment, and for 
virulence of this pathogen, are poorly understood. Examples of the putative virulence 
factors include the following: urease, an enzyme that may play a role in neutralizing gastric 
acid pH (Eaton et al., (1991) Infect. Immunol 59: 2470-2475; Ferrero, R.L. and A. Lee 
(1991) Microb. Ecol Hlth Dis. 4: 121-134; Labigne et al., (1991) J. Bacterial 173: 1920- 
1931); the bacterial flagellar proteins responsible for motility across the mucous layer. 
(Hazell et al., (1986) J. Inf. Dis. 153 : 658-663; Leying et al., (1992) Mol. Microbiol 6: 
2863-2874; and Haas et al., (1 993) Mol Microbiol 8: 753-760); Vac A t a bacterial toxin 
that induces the formation of intracellular vacuoles in epithelial cells (Schmitt, W. and R. 
Haas, (1994) Molecular Microbiol 12(2) : 307-3 19); and several gastric tissue-specific 
adhesins. (Boren et al., (1993) Science 262: 1892-1895; Evans et al., (1993) J. Bacteriol 
175: 674-683; and Falk et al., (1993) Proc. Natl Acad Sci. USA 20: 2035-203). 

Numerous therapeutic agents are currently available that eradicate H. pylori 
infections in vitro. (Huesca et al., (1993) Zbl Bakt. 2g0: 244-252; Hopkins, R. J. and J. G. 
Morris, supra). However, many of these treatments are suboptinmlly effective in vivo 
because of bacterial resistance, altered drug distribution, patient non-compliance or poor 
drug availabilty. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics 
combined with bismuth are part of the standard regime used to treat H. pylori infection. 
(Malfertheiner, P. and J. E. Dominguez-Munoz (1993) Clinical Therapeutics 15 Supp. B: 
37-48). Recently, combinations of a proton pump inhibitors and a single antibiotic have 
been shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E. Dominguez- 
Munoz supra). However, methods employing antibiotic agents can have the problem of the 
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